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1 TALK ABOUT TALK^ 

The lexicogrammar of a language represents the semantics cf that language, not as 
one half of a sign represents the other half, but as a whole systv^n of systems represents 
another system of systems: representation as it is built into the organization of natural 
languages is the symbolic relation we find or posit between different strata (levels) of 
linguistic organization. Consequently, this is the kind of representation the linguist is 
concerned with. But he/she must also be concerned w.ih representation as it is defined 
by linguistics itself, viz., the linguist's representation of natural language according to 
some theory of language. The issue here is how to represent language 
metalinguistically; how to talk about talk. 

The two tasks of representation are comparable in many respects. Indeed, we could 
turn linguistic theory back on itself and use the framework that has been devised for 
the study of language to understand and talk about metalanguage. For instance, we 
could try to do systemic metalinguistics in this way. What I will do in this paper is in 
fact metalinguistics, but it would take too long to try to situate my topic within a 
systemic metalinguistics here. It will, however, be useful to say something about the 
context. The central topic is the representation of lexicogrammar. 

The general purpose that has prompted the present exploration of representational 
issues is text generation. It is text synthesis and contrasts with text analysis, but 
both processes can be used to study text. It is the process of creating a text in response 
to a communicative goal (purpose). The process is specified in a text generation model 
(system). As a metalinguistic purpose for linguistics, text generation can be compared 
and contrasted with other purposes such as text understanding, text analysis, the 
grammatical tagging of a text corpus, literary interpretation, language education, 
contrastive analysis, and typological linguistics. These purposes are all, of course. 



In my work on representational issues, I have profited Trom discussions w^th a number of people, 
including Michael Halliday, Bill Mann, E>ob Kasper, Robin Fawcett, Peter Fries, Jay Lemke, and John 
Bateman, and participants in the Systemic Workshop in Ann Arbor, all of whom I am very grateful to. I 
am also greatly indebted to Susanna Gumming, Bob Kasper, and Lynn Poulton for helpful comments on a 
draft of this paper. All errors and oversights are my own. 
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systematically related as studies of language, but they also occur in different contexts 
and raise different demands, a.g., on the representation of the theory. For example, 
perspicuity is probably higher on the list of demands on grammatical representation in 
language education than in text generation. 

The computer is a highly useful tool in work on text generation - not necessary, but 
useful. It helps the linguist manage large bodies of data, such as a lexico-grammar, and 
it enables him/her to run actual tests of the model of text generation being built. 
There are several examples of systemic work on text generation or work using a 
cor putational grammar closely related to systemic ideas. In the early 1970s, Davey 
(1978) used a systemic grammar partly derived from Hudson's work (e.g., Hudson 
(1971)) in a text generation system. McKeown (1982; 1985) used a Functional 
Unification Grammar (^UG) in her TEXT system, and Appelt has included a FUG as 
one of the levels in his generation system (Appelt (1983)). FUG was developed by 
Martin Kay (see e.g., Kay (1979)) and is closely related to systemic functional grammar; 
the grammar and its relationships to systemic grammar are discussed in Bob Kasper's 
paper in this report. 

In 1980, work on a large systemic grammar of English for text gener?tion started at 
the Information Sciences Institute of the University of Southern California. It is called 
the Nigel grammar and is being developed as an integral part of a text generation 
syst-m. Bob Kasper is currently working on a version of Nigel to be used for parsing. 
John Bateman has started work on a computational systemic grammar of Japanese, also 
in the context of text generation (Bateman (1985)). Also, Robin Fawcett is developing a 
computational systemic grammar of English, implement'ng his contributions to systemic 
grammar (cf. Fawcftt (1980)). The present discussion is derived from the work on the 
Nigel grammar. For example, the constraints on realization operators to be discussed 
are constraints in the Nigel grammar. The framework of the Nigel grammar is intended 
to be as faithful as is currently possible to Michael Halliday's ideas. My paper is not, 
however, an introduction to the Nigel grammar; for brief presentations see e.g., Mann & 
Matthiessen (1985), Matthiessen (1983), and Mann (1983). 
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What is the context of text generation, particularly in computational work? (For a 
discussion of related issues, see also Fawcett's paper in this volvme.) The context can 
be characterized with the help of systemic theory, using field (the social action and 
subject matter, whi-o's going on), tenor (the social relationship, who's taking part), and 
mode (how are the meanings exchanged); see e.g., Halliday & Hasan (1985). 

In the field, processes are foregrounded. There must be dynamic accounts of 
traversing the system network, activating realization statements, etc. in addition to 
static accounts, in the text generation model; the two perspectives have been discussed 
by Martin (1985). Most descriptive work has been focused on the system network at 
rest and on structure as a static product of structure building. However, there are often 
two interpretations, a dynamic one and a static one. For example, systems can be read 
as instructions to choose under certain conditions (dynamic) or as statements about the 
availability of options (static\ 

In the tenor, explicitne?r and formality are emphasized since the user of the text 

generation model is a computer and not a knowledgeable linguist as in a lot of 

descriptive work. When there are gaps in the account, the computer cannot fill them 
in. It cannot clarfy ambiguous statements. 

The mode has to be algebraic rather than graphic; it has to be a mode that can be 
manipulated in computer programs. In descriptive work, the graphic mode is very 
valuable and has usually been favoured. Systems and system networks are represented 
by graphic diagrams; and structurtis are represented by box diagrams. 

Advances have been made in the work on Nigel in all three areas of the context so 
that more demands can be met. For example, more is known about the process of 
traversing a system network and about constituency ordering. However, the focus of 
this paper will be on representational issues that have been discovered and remain and 
not on these advances. Although I will discuss some approaches to solutions, this is 
essentially a paper about problems and not about solutions. Whenever we consider 
solutions, there is a high demand on them:, the representation should be as natural and 
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as iconic as possible with respect to the theory. Functional felicity is more important 
than generative power. 

Although I will try to mentioi the various aspects of systemic work the paper 
a<idresses, the paper presupposes some familiarity both with systemic theory and with 
systemic descriptions of English. 

2 THEORY AND REPRESENTATION: LAG 

There seems to be a general tendency for theory to precede the representation of it, 
which is, of course, not surprising. There is a lag between theory and representation, 
but the lag depends on the nature of the representation. Systemic linguistics has always 
allowed for a cline in the explicitness of the representation: diagrams and algebraic 
statements that can be interpreted formally coexist. This cline has pro' ed very useful, 
since different contexts place different demands on the representation. Aspects of the 
theory can be represented diagrammatically, although we do not yet know how to give 
them a representation explicit enough to be implemented computationally. (Cf. 
Martinet's distinction between visualization and formalization in Parret (1974).) For 
example, as we will see in Section 4.3, there is a graphic representation of the kind of 
structure generated by the textual metafunction, "pulsive" or "periodic" structure, but 
we do not yet know how to represent it in a formal algebraic mode. 

Linear recursion. An example of explicit representation lagging behind theory 
comes from the logical component of grammar. The theory of linear recursion had been 
developed by the mid 1960s (cf. for example Halliday (1965); Hudson (1971: 60-63)). A 
recursive system can be represented in an informal graphic way; see Figure 1. 
(Conventions are given in the Appendix. There is a system with two options 
represented by features, simElex vs. complex . If the feature 'complex' is chosen in the 
system 'simplex/ complex ', the entry condition to the system 'a/b' is satisfied, but there 
is also a "recursive" loop back to the system ' simplex / complex '. Associated with the 
feature complex is a realization statement (x -> y, given in the box under the feature 
complex), specifying a structure fragment. The meaning of it will be discussed in 
Section 4.4.) 



ERIC 



4 10 



simplex 

I — a 




Figure 1: Informal graphic representation of recursive system 

However, although the theory of linear recursion had been developed, the problem of 
the explicit representation of it had not been solved. Henrici worked with an explicit 
version of systemic representation. Comparing embedding recursion (recursion through 
rankshift) with linear recursion (iteration), he notes that the latter is "rather more 
troublesome" (Henrici (1966)). One problem is that the recursive system (i.e., a system 
with a loop back to its disjunctive entry condition) "would not be a conventional 
system, in that at least one of the terms in the system would also act ViS an entry 
condition tc the system, contrary to the partial-ordering imposed on ordinary systems". 
In other words, the explicit interpretation of the system representation as a partial 
ordering brought out the issue of what the recursive system means. We have had 
similar experience in our work on the Nigel grammar. 

In the case of linear recursion, the lag between theory and explicit representation is 
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still very evident. Davey did not implement recursive systems in his computational 
generation grammar; he writes {1978: 81): "It is clear that systemic functional grammar 
cannot produce coordinate items without some extension of its purely classificatory 
apparatus, but we do not feel confident that the right extension has been found." 
Similarly, McCord (1975: 211) discusses a modification of conventions to handle linear 
recursion. There is no generally accepted explicit interpretation of it for a formal 
representation, neither for the recursive system (see Section 5.1) nor for the structural 
output (see Section 4.4 ). 

Box diagrams and realization statements. The representation of structure in 
general is another example of the lag between theory and explicit representation, as we 
will see. There are various structural relationships (e.g., discontinuity, which I will 
discuss later) that do not pose a problem for the informal box au ?ram representation 
but prove to be a problem for explicit realization statements. 

The box diagram is like a tree diagram in that it represents a product rather than a 
process, the product of a structure specification (realization statement), but it is used 
instead of the tree diagram because it facilitates the representation of layering (see 
Figure 9). Examples of box diagrams can be found in Figures 3, 10 and 15. 

The representational problems I will identify and discuss are usually problems for a 
representation that is more explicit than a box diagram representation. I will make 
reference to Martin Kay's Functional Unification Grammar (FUG) and Bob Kasper's 
: aper in this report where this kind of grammar is presented and discussed as a source 
of possible solutions to the problems: FUG was designed to be explicit and forms 
enough for the computational tasks of generation and parsing. 

3 DIMENSIONS OF STRUCTURE 

All parts of a systemic grammar are ultimately interconnected. The grammar is a 
network of systems of options; each option may have one or more realization statements 
associated with it, serving to specify a structure fragment or to predetermine the choice 
of an option as the grammar is re-entered. Given this interconnectedness, we could 



choose various starting points in a discussion of representational issues, for example the 
representation of the net'w'ork of systems or the representation of structure. We could 
discuss the issues in terms of the grammar as a system at rest (the potential) or of the 
grammar as a process (the potential actualized). We could choose one of these various 
angles on lexicogrammar and still hope to arrive at u diagnosis of representational issues 
that is reasonably general. 

In this sectioL, I shall unravel the grammar by means of the various realization 
operators used in Nigel: conflate, insert, order, expand, and preselect. They serve to 
specify the structural output of the grammar; the representational issues will 
consequently be exemplified first in terms of the specification of structure. There are 
various reasoiis for starting with the realization operators. One is that they have been 
discussed and explored less than the system network representation. Another is that 
they are more restricted in representational applicability than the system network 
representation (as we vill see) and will serve to identify represencational issuer mo'^e 
clearly. 

3.1 Three dimensions of structure 

Structure is a composite object or process in which each fragment is specified by the 
r Jization operators. For the discussion of the realization operators, it is useful to 
think of structure as organized in te-ms of three dimensions (cf. Halliday 1961; 
Matthiessen 1985). The dimensions are manifested both in the system's potential for 
creating structures and in examples instantiating this potential. For each dimension we 
find one or more realizati<^"i operators: 

1. Structuring: insert, order, expand 

2. Metafunctional layering: conflate 

3. Rank: preselect 

Structuring and metafunctional layering define the structure of a unit such as a 
clause, a nominal group, or a verbal group as a configuration of functions. The 
functions are realized by units of the rank below, through preselection. (There is a 
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distinction between a structure, a configuration of functions, and a syn^agm, a sequence 
of classes such as nominal group and verbal group: see Hallid^y (1968). To keep the 
discussion manageable, I will not bring in the distinction, but will merely note here that 
it exists and could be used to address certain representational problems. The realization 
operator Preselect establishes the relation between a function in the function structure 
of a unit at one rank and a grammatical class represented as a feaiure at the rank 
below.) The realization operators to be discussed are as follows: 
Name of operator Description 



(Insert Subject) 



+Subject 



Function Inserted as 
constituent of the 
structure of the unit 
being specified 



(Order Subject Finite) SubJ ^ct^Flnlte One function ordered to 

precede another function 



(Expand Mood Finite) Mood (Finite) 



One function Is expanded 
to have another funclton 
as constituent 



(Conflate Subject 
Agent) 



(Preselect Subject 
singular ) 



Subject/Agent 



Subject: 
slnjgular 



One function Is conflated 
with anothei function to 
form part of the same 
constituent function bundle 



A function Is preselected for 
a feature; the realization 
of the function Is 
cc^stralned to display that 
feature 
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3.2 Sample grammar 

Before discussing the (Nigel) realization operators, I will present a fragment of the 
Nigel grammar so that I can refer to it later for illustrations: see Figure 2. The 
network fragment consists of systems belonging to MOOD, an interpersonal region in 
the clause grammar, to TRANSITIVITY, an experiential region in the clause grammar, 
and to VOICE, a textual region: 

MOOD systems: MOOD TYPE, INDIC[ATIVE] MOOD PERSON, INDIC[ATIVE] 
TYPE, INTER [ACTANT MOOD PERSON] TYPE, and [DECLARATIVE] 
SUBJ[ECT] PRES[UMPTION]) , 

TRANSITIVITY systems: AGENCY and AGENTIVITY, and 

VOICE svstem: VOICE. 
# 

Since it is only a partial fragment, the grammar in Figure 2 cannot be used to 
generate complete clause structures. Graphic conventions are given in the Appendix. 
For present purposes, the systems MOOD TYPE and AGENCY can be assumed to be 
simultaneous, i.e. reachable from the same entry condition, viz., the feature clause. 

Before the part of the grammar specified in the diagram above is entered, the 
following has occurred: the functions Predicator, Medium, Process, and Finite have 
been inserted; Predicator and Process have been conflated; Process has been preselected 
to be a verbal group; and Medium has been preselected to be a nominal group. 
Traversing the mood part of the network, we can choose the following path: 
System: feature chosen Realization sta*^<iment 



MOOD TYPE: Indicative +Subject, Subject: nominative, 

Mood(SuDject) 

INDICATIVE TYPE: declarative Subject ^ finite 



ERLC 



9 15 



INDICATIVE MOOD PERSON: 



Interactant 

INTERACTANT TYPE: speaker Subject: speaker, Finite: speaker 

SUBJECT PERSON: explicit 

Trave'^ing the two transitivity systems AGENCY and AGENTIVITY, we can choose 
AGENCY: effective Process: effective 

AGENTIVITY: agentlve, +Agent, Agent: ngp 

and in the VOICE system 
VOICE: operative Subject/Agent 

Vhe structure fragment that results from the application of the realisation statements 
is given in Figure 3. 

The structure above is a partial specification of examples like / have dissolved the 
parliament, I have frightened my poor friend, and / will open the gates. After this 
brief example of the system network, realization statement associated with features in 
it, and the function structure resulting from the application of realization statements, I 
will now turn to the first dimension mentioned in Section 3.1. 

3.3 Structuring 

Structuring is the dimension that defines syntagmatic organization within one rank 
and within one (metafunctioral) layer of structure. Grammatical regions make 
structural contributions such as transitivity structure (e.g.. Agent Process Medium: see 
Figure 3), mood structure (e.g., Subject Finite: see Figure 3), theme structure (e.g.. 
Theme Rheme), tense structure, and modification structu 3. 
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A .^s.^ ^ A . .-.Q 1.1.x V, 1.1. w wi ^iciiiiiiiar 



r" indicative 



MOOD 



TYPE 



+ Subject 
Subject: 

nominative 
Mood(Subject) 



imperative 



declarative 



INDIC. 



TYPE 




|— implicit 



*Subject 



^ mterrogative 



INDIC 



MOOD 
PERS 



r— mteractart 



other 



INTER, 



speaker 



Subject: 
speaker 

Finite: 
speaker 



TYPE |_ speaker-plus 
addressee 



L- explicit 



1— eTfective 



AGENCY 



Process: 
effective 



— middle 



Process: 
middle 



P- agentive 



AGENT- 



IVITY 




I— non-agentive 



Subject/ 
Medium 

Predicator: 
passive 



VOICE 



— operative 




receptive 



Subject/ 
Medium 

Predicator: 
passive 
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Agent 

ngp 




Process 

vgp 


Medium 

ngp 


Subject 
speaker 
nomin. 


Finite 
speaker 


Predicator 




Mood 



TRANSITIVITY 



MOOD 



Figure 3: Fragment of function structure 

The real'ization operators used to specify structuring are Insert, Order, and 
Expand. Their use will be illustrated with examr'es from the MOOD region of English 
grammar. The presentation of each operator is organized into a characterization of it, 
the result of its application, a specification of some constraints on it. and 
representational problems. 

3.3.1 Insert 

Characterization. The Insert operator is used with a grammatical function such as 
Subject, Agent, or Theme as its operand, e.g., (Insert Subject) or +Subject for short. It 
specifies the presence of the function in (adds the function to) the function structure 
being generated as a constituent of that structure. For example, in the specification of 
a clause, we may associate the realization statement +Subject with the grammatical 
feature indicative as in the grammar fragment above. This means that the function 
Subject is present in the structure of a clause as a constituent of ^he clause structure. 

Result. The result of the application of +Subject is simply a constituent branch and 
node in a tree c'iagram or a constituent box in a box diagram; see Figure 4. 
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Subject 



Subject 



(i) Tree diagram 



(ii)Box diagram 



Figure 4: +Subject — structural result in diagram 



Constraints. I will mention two important constraints on the Insert operator, one 
having to do with the status of an inserted function and one with its const! uUeu^y: 

1. When an inserted function is declared to be present, "present" means 
'ac'.ually present' and not 'potentially present' or 'present in the default 
case'. Once inserted, a function is, and remains, actually inserted. 

2. The fact that Subject is a constituent of clause structure means that it is 
available to other structure specifications within the clause but not outside 
the clause. It also means that it is not a constituent of any other unit, e.g., 
arankshifted (embedded) constituent clause. Furthermore, a clause cannot 
insert a function into e.g., a verbal group or a nominal group. 

Problem: implicit functions. Both constraints are well motivated in principle, but 
there are situations where they lead to some complex conditions in the grammar. 
Consider the first constraint. 

How do we deal with implicit functions, e.g., implicit Subjects? The function Subject 
is explicit, actually present, in almost all indicative clauses. Consequently, it is 
tempting to associate the realization statment +Subject with the feature indicative . If 
we do (as in system MOOD TYPE of the sample grammar in Figure 2), we will get 
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Subjects even where we do not want them, where they are implicit, as in /// haven't had 
dinner yet. 

There is no formal problem with adding to the conditions under which the statement 
+ Subject is applied so that Subject does not get inserted where it should be left 
implicit, but the added conditions do complicate the grammar a bit. The insertion of 
Subject can be stated in a gate whose disjunctive entry condition specifies under what 
conditions a clause may have an explicit Subject. (A gate has an entry condition like a 
regular system, but only one output feature/term /option.) The output is simply one 
feature, explicit subject , wHh which the realization statement +Subject is associated. 
This is illustrated for the sample grammar in the diagram below; cf. the gate for 
Subject insertion in Matthiessen (1985: Figure 9). (The grouping of the features of the 
entry condition is not formally significant.) 





explicit 

(declarative) 



explicit subject 



+ Subject 



interrogative 



Figure 5: Gate for inserting Subject 
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The complication is increased when we add the conditionality for functions such as 
Agent that may get confiated with Subject; see Section 3.4. 



One solution. There is one way of dealing with the problem which involves a re- 
interpretation of the Insert operator. Instead of defining its result as 'actually present', 
we can define it as 'potentially present', i.e. 'present in the default case, unless the 
function is declared not to be present'. We can then add a preventive operator, call it 
*, to be used to specify absence. Assume that Subject has been inserted, + Subject 
(MOOD TYPE: indicative). If the grammar states *Subject, the Subject, although 
potentially present, is not actually present (system DECLARATIVE SUBJECT 
PRESUMPTION: implicit). There would be two possible cases if a function has been 
inserted: 

+ Subject ^potential presence* 

[I] * Subject 'not actually present" 

[II] 'actually present, by default* 

Potential presence can be compared with presence of a function in Fawcett's starting 
structure (described in Fawcett (1980)). 

3.3.2 Order 

Characterization. The operator Order is applied to two functions, e.g., (Order 
Subject Finite) or Subject " Finite. It introduces a precedence relation between the two 
functions in realization sequence. There are two varieties of this operator, one that only 
specifies the precedence relation and one that also specifies contiguity so that no other 
constituent can intervene. I shall not distinguish them here. (I shall also omit 
OrderAtEnd aaJ OrderAtFront; cf. Mann & Matthiessen (1984) and Matthiessen 
(1985).) 

The realization operators that relate two functions or a function and a feature specify 
relations that can be classified in terms of Halliday's theory of relational transitivity 
(Halliday 1985). I will not elaborate on this typology of realization operators here, but 
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will indicate what relations they specify as we meet them in the presentation. For 
example, the Order operator specifies a circumstantial (enhancing) relation, e.g., Subject 
precedes Finite. 



Result. The result of the application of Finite ' Subject is represented 
diagrammatically as follows: 




Figure 6: Finite ^ Subject - structural result in diagram 

In a box diagram, we usually cannot tell whether the sequence is significant, i.e. 
introduced by a sequencing statement like (Order Finite Subject), or not. 

Constraints. I will mention three constraints on the Order operator, all of which 
have to do with ordering and constituency status: 

1. The Order operator specifies actual sequence and not potential sequence; it 
does not specify a default sequence that could be overridden. 

2. Only constituents of the unit whose structure is being specified can be 
ordered in relation to one another. For example, the clause constituent 
Subject can be ordered with respect to the clause constituent Finite but not 
with respect to a Qualifier from a nominal group structure. 

3. Once a functional constituent such as Subject has been ordered, all the 
constituents of the unit realizing that function (Subject) have to obey the 
ordering specification. 

Although there is no inter-rank ordering, there is no constraint that disallows inter- 
layer ordering, i.e. ordering statements involving functions from different layers of 
structure. For example, there is no formal problem with an ordering statement such as 
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Subject * Medium with one mood function and one transitivity function. However, 
there has not been any reason to use ordering statements of this kind so far in Nigel's 
grammar of English. 



There is a mechanism for ordering other than the Order operator: default ordering 
lists. They list functions in the sequence in which they will appear unless another 
ordering has been specified explicitly by means of the Order operator. The first 
constraint says that the Order operator always specifies the actual sequence. In 
contrast, the lists specify potential (default) sequence. A default ordering list in Nigel is 
comparable to Fawcett's starting structure as an ordering statement. 

The next two constraints are appropriate most of the time, but there are certain 
situations where they ai 5 too strict, viz., theme and information prominence. These 
situations typically involve discontinuity; one case will be discussed next. 

Problem: discontinuity. Consider the example They called the meeting off from 
Halliday (1985), with the process call ... off. Halliday gives it the following transitivity 
structure in box diagram representation (the example is not covered by the grammar 
fragment in Figure 2). 

Although the box diagram looks different from those that have been used up to now, 
there is no problem in drawing it. In other words, the discontinuity can be represented 
in the box diagram. However, there is a problem for the formally interpreted 
realization operators described above. None of these operators can serve to specify the 
structure represented by this box diagram. 

For example, we cannot state Process * Medium * Process because that is 
contradictory, claiming that Process both precedes and follows Medium. Two 
approaches to this kind of problem have been used '^n the literature. 

1. Systemic grammarians have sometimes used a subscript to indicate two 
parts of a split function: Process^ * Medium * Processg where the first part 
is realized by called and the second part is realized by off. However, this 
device does not make explicit what the two parts are; the identification of 
them is left to the knowledgeable grammarian. 
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They called the meeting off 



Agent 


Pro- 


Medium 


-cess 




mate- 




-rial 



Figure 7: Transitivity structure of discontinuous example 

2. Alternatively, we can introduce a new realization operator. For example, 
Huddleston works with the notion of inclusion: one function can be 
included within another function. Thus, we would have (Include Medium 
Process), usually Process <Medium>. The problem with this strategy is 
the same as with the above-mentioned splitting strategy. If we only specify 
that one function (Medium) is to be included witliin another function 
(Process), this does not provide enough information to assign the first 
function a place within the second function. This problem does not 
invalidate the use of inclusion, of course, but it indicates that inclusion alone 
may not be enough, since more information is needed for an unambiguous 
specification of ordering. We could take inclusion to mean that the function 
included within another function is not given a unique ordering until the 
cycle through the grammar when the second function is realized. This 
method would need more communication between ranks than is presently 
used in Nigel.) 

In spite of the problem with the analysis of called ... off ss 3, discontinuous realization 
of Process, we do not want to give it up. The sequential discontinuity is matched by 
lexical continuity: 'call off is one lexical item; it is not semantically decomposable. 

I shall return to the problem of discontinuity after discussing the second dimension of 
organization, i.e., metafunctional layering, in Section 3.4. 
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3.3.3 Expand 

Characterization. The operator Expand is applied to one function to be expanded 
and one or more expanding functions, e.g., (Expand Mood Subject) or Mood (Subject), 
Mood (Subject, Finite), and so on. It creates a constituency relation between the 
expanded function and the expanding function(s). Thus, Mood (Subject) means that 
Subject is one of the constituents of the Mood constituent. 

When the Insert operat'^r is applied to a function, the function is inserted as a 
constituent of the unit being generated. (It does not have to be an immediate 
constituent.) For example, the statement +Subject inserts Subject as a clause 
constituent. The Expand operator is used when a function is not an immediate 
constituent of a unit (which it is by default) but rather an immediate constituent of a 
function in that unit. For example. Mood (Subject) specifies that Subject is an 
immediate constituent of Mood rather of the clause. (In principle, indefinitely long 
constituency chains could be created; in practice, there has rever been any need for a 
situation more complex that the one illustrated by Mood (Subject), where Mood is an 
immediate constituent of the clause and Subject is a constituent of the Mood 
constituent.) 

The Expand operator specifies a possessive (extending) relation of the part-whole kind, 
e.g., Subject is part of Mood. (To bring it up to date with Halliday's terms for the 
clause complex relations, we should perhaps call the operator Extend.) 

Result. Figure 8 diagrams the structural result of the application of Expand. 

Constraints. There are two constraints, the second of which anticipates another 
realization operator. Preselect, which is described in detail in Section 3.5: 

1. Expansion is actual expansion, not potential expansion. 

2. A function expanded by another function cannot be preselected for a 
feature. Only functions that are immediately realized by a grammatical 
class can be preselected. There is no mechanism for spreading or 
distributing preselected features from an expanded function lo its expanding 
functions. 
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Subject 


Finite 


Mood 



Figure 8: Mood (Subject, Finite) - structural result in diagram 

One example of the consequences of the second constraint will be mentioned in Section 
4.2. Expand is not used often. In Nigel, its main uses are in mood structure reflecting 
prosodic organization (see Section 4.2) and in theme structure reflecting thematic 
prominence (see Section 4.3). In the second case, it crosses metafunctional layering (see 
Section 3.4), as in Theme (Locative), where Theme is textual and Locative is 
experiential. 

3.4 Metafunctional layering 

Metafunctional layering is the dimension that defines the organ! 7?. Ion across layers of 
syntagmatic structuring such as transitivity structure, mood structure, and theme 
structure; it represents the reconciliation ol these various metafunctional contributions. 

Characterization. The realization operator used to specify metafunctional layering 
is Conflate. It is applied to two grammatical functions, e.g., (Conflate Subject Agent) 
or Subject/ Agent (sometimes the equal sign has been used. Subject = Agent). It means 
that they describe the same constituent. Conflate specifies an identifying relation: 
Subject is Agent; Agent is Subject. 

Result. The structural result of conflation is called a function bundle or fundle. It is 
represented by a column in a box diagram; see Figure 9 (There is no commonly used 
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tree diagram representation of layering, although there is a tree-like diagram in 
Relational Grammar where the so-called strata are comparable to metafunctional 
layers.) 



Subject 



Agent 



Figure 9: Subject / Agent - structural result in box diagram 

Constraints. I will discuss three constraints and their associated problems. 

1. Conflation is actual conflation, not potential conflation. Both functions in a 
conflation statement have to be inserted at some point during the same pass 
through the grammar as the conflation. If one or both functions do not get 
inserted, there is an error in the grammar. 

2. Only functions from the structural specifications of the same unit can be 
conflated; there is no inter-rank conflation. For example. Subject (from the 
mood structure of the clause) and Agent (from the transitivity structure of 
the clause) can be -^-^nflated but Subject and Minirange (from a prepositional 
phrase structure) cannot. 

3. The functions ^*iilated in a function bundle such as Subject/Agent always 
describe the same constituent. It is not possible to apply Conflate to 
functions and maintain different constituents for the different functions. 
This constraint is related to the discontinuity problem that I Introduced in 
Section 3.3; I will return to the problem later. 

Problem: implicit functions (continued from Section 3.3.1). What happens if 
a function that has been inserted, say Agent, gets conflated with a function that has not 
been inserted, say Subject? This situation was addressed above in the first constraint: 
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if Subject gets inserted af***' the conflation, there is no problem. As long as both 
furctions in a conflation statement get inserted at some point, the conflation is valid. 
However, if one of the two functions does not get inserted, this is considered a 
grammatical error. This is quite reasonable in most cases, but we have to consider how 
the constraint affects implicit functions, the problem introduced in Section 3.3.1. 

Aseume that the transitivity of a clause is such that the function Agent gets inserted 
(system AGENTIVITY: agentive in the grammar fragment above in Figure 2). \ssume 
further that the clause is operative (active; the VOICE system), resulting in a conflation 
of Agent and Subject: Agent/Subject. Now, if mood choices lead to an explicit 
Subject, there is no problem: Subject will be inserted a*^ was Agent. But if mood 
choices lead to an implicit Subject (system DECLARATIVE SUBJECT 
PRESUMPTION: implicit ), there is a problem: Agent \r> inserted but Subject is 
declared to be implicit and hence absent ("*Subject" in the grammar fragment above), 
resulting in a conflation between an inserted function and an absent function. How can 
the presence o^ Agent be blocked? 

Solutions, (i) One solution is simply to add conditions for the insertion of Agent so 
that the function does not get inserted unless Subject is explicit in an operative clause. 
This approach is used in Nigel and poses no formal problems. There Is a gate, a 
"system" with only one output feature, for inserting Agent A""th a complex entry 
condition (a conjunction of the features agentive and explicit subject, f^on the sample 
grammar and the gate for inserting Subject, respectively). The complexity o{ the entry 
condition is a drawback, of course. It would arise for each function that may be 
conflated with Subject. 

(ii) Another solution is to rely on the distinction between potential presence and actual 
presence discussed in Section 3.3.1. When the statement +Agent is applied, the function 
Agent is potentially present. If it gets conflated with another function. Subject, that 
has been inserted as potentially present, both functions will be actually present unless 
there is a statement to the contrary. There conld be a preventive statement, specifying 
the absence of Subject (as in the system DECLARATIVE SUBJECT PRESUMPTION 
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in the sample grammar): *Subject, which would cause the conflation Agent/*Subject to 
be absent (implicit) as a whole. Or alternatively, Subject may never get inserted, la 
which case a convention would prevent the potential presence of Agent from becoming 
actual presence. 

These conventions are not ad hoc, but functionally motivated: The presence of an 
element of structure or a bundle of functions depends on more than one metafunction; 
it is a layered object and unless all the layers are present, the object itself won't be 
present. 

Problem: conflation outside unit. The second constraint on conflation is usually 
desirable. In fact, it reflects an important aspect of the functional organization of a 
grammar: the units of the grammar are units because they unite particular functional 
contributions, e.g., mood, transitivity and theme for the clause. However, there are 
sone cases, such as the Minirange of a prepositional phrase realizing a locative 
circumstance of a clause (This bed has not been slept in), where the constraint is 
problematic. 

Problem: discontinuity (continued from Section 3.3.2). I left the example 
They called the meeting off without a solution in Section 3.3.2. I noted that it 
exemplifies both continuity and discontinuity in the process call ... off. Lexically, it is 
continuous; that is, it is one lexical item. In terms of sequence, it is discontinuous. 

Towards a solution of the discontinuity problem. Halliday (1985) provider a 
solution to the problem of representing examples with di- ontinuously realized phrasal 
verbs. It relies on the metafunctional layering, so it could not be stated only in terms of 
one dimension of structuring (Section 3.3). 

Halliday's account involves both transitivity structure and mood structure. The two 
interpretations for They called the meeting off are: 

1. Transitivity: one Process (of class 'materiar) call off. The lexical 
specification can be stated in terms of Process: Process: 'material'. (In 
other words, Process is preselected for the lexical feature 'material'.) 
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2. Mood: one Predicator call and one Adjunct off. The sequence can be 
stated in terms of Predicator and Adjunct: Predicator " Complement " 
Adjunct. 

Each interpretation alone captures only part of the facts, but we get the full picture, 
when they are brought together. The mood structure is also used to state the 
inflectional form of the verb. The function Finite is constrained to be 'past', and since 
the clause is declarative and simple past. Finite is conflated with Predicator. The box 
diagram for the example is given in Figure 10 (see Halliday (1985)). 



They 


called 


the meeting 
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Agent 


Pro- 


Medium 


-cess 




mate- 




-rial 


Subject 


Finite 

past 
Predicator 


Complement 


Adjunct 



Figure 10: Two layers of structuring for discontinuous example 

Halliday's analysis represented in the box diagram avoids the earlier problems 
discussed in Section 3.3.2: since there is no discontinuity in the sequence Predicator " 
Complement " Adjunct, there is no need to design a new inclusion operator and there is 
no need to split the Process. (The problems with these two solutions were discussed 
above.) Since we retain an unsplit Process function, there is no problem in constraining 
it to be one lexical item. Finally, since Predicator and Adjunct are kept distinct, there 
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is no problem in conflating Finite and Predicator, thus constraining that part of the 
phrasal verb to be 'past' in our example (i.e. caiitd off rather than call of fed). 

There 5s a problem, however, when we move from the box diagram representation to 
look for a more formally interpretable realization statement. The problem is the 
Conflate operator itself. If we conflate Process and Predicator and then also Process 
and Adjunct to get the column alignment in the box diagram, the result is just one 
constituent, Process/Predicator/Adjunct. Instead of this conflation, what we need to do 
is to conflate Predicator and Adjunct with Process without conflating Predicator and 
Adjunct with one another. We need to be able to state (Predicator & Adjunct)/Process. 

Although .his kind of conflation does not exist as a Nigel realization operator now, 
something like it is supported in Functional Unification Grammar: Kasper's paper in 
this report presents the solution in those terms. 

3.5 Rank 

The third dimension of organization to be discussed is rank. Structuring and layering 
combine to form the function structure of one unit such as the clause. Rank organizes 
the gram nar into a scale of units: clause - group/phrase - word - morpheme. 

Characterization. The realization operator that handles communication betw.^en 
ranks is Preselect. The Preselect operator is applied to a function and a feature. For 
example, Subject nay be preselected to be singular: (Preselect Subject singular) or 
Subject: singular. This means that the feature singular will be chosen (in the 
f^njMBER system) v/hen the grammar is re-entered to realize the Subject function 
bundle. 

Result. The result of the application of the operator Preselect is like a two-cell 
tagmeme with a slot (the function) and a filler (the feature). It is represented simply by 
listing the preselected features under the function they have been associated with. For 
example. Subject: singular is recorded in the following way; see Figure 11. 

Constraints. I will mention two constraints. The first is a familiar one, having to 
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Subject 
singular 



Figure 11: Subject: singular - structural result in box diagram 
do with the actuality of the result of the application of the operator. The second makes 
reference to the rank scale. 

1. Preselection is actual preselection. Once a feature has been associated with a 
function through preselection it has to be chosen when the function is 
realized and the system network is traversed. This constraint can be a bit 
awkward, since it would sometimes be useful to be able to state that a 
feature is to be selected if its system is entered without forcing the system to 
be entered. For example, it would be useful to be able to state Subject: 
nominative for all Subjects, although the feature nominLtive will only 
actually be chosen if the Subject is pronominal. 

2. In principle, the operator works downwards in terms of the rank scale, from 
clause to group/phrase, from group/phrase to word, and from word to 
morpheme. There may also be rankshift, e.g., preselection from clause to 
clause as group/phrase or from group to clause as word. Preselection cannot 
be stated within the same rank unless there is rankshift. For example, it is 
not possible to use Preselect to state a preselection relation between a 
nominal group and a verbal group when both realize clause functions such as 
Subject and Finite/Predicator. The constraint can also be phrased in terms 
of constituency. Preselection works downwards in constituency from a whole 
to one of its parts; it does not work across from one part to another. The 
control of selection exercised by Preselect is top-down control. 

For example, if we want to use Preselect to ensure agreement between Subject and 
Finite, we have to set up clause systems for SUBJECT PERSON (or perhaps more 
2,ccurately MOOD PERSON as in the sample grammar in Figure 2, cf. Section 4.2). If 
the grammar contains a s/stem for INTERACTANT SUBJECT where one of the 
features is addressee subject, we can associate the two realization statements "Subject: 
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second person" and "Finite: second person" to ensure agreemen in examples such as 
you are. This strategy works up to a point, but there are problems, which are discussed 
in the following illustration of representational problems. 



Problem: agreement. As we have seen, we can achieve agreement by preselecting 
both Subject and Finite. When Subject is preselected to be plural, ? e.. Subject: plural, 
this feature can be instantiated in the nominal group in at least these four different 
ways. We have for example: 

additive plurality: The title and the purport are ... 
plurality in disjunction: The title or subtitles are ... 
multiplicative plurality: The titles are ... 
collective plurality: The crew are ... 

The main problem is one of anticipation. Although these examples are all plural, the 
nominal groups are grammatically different. For example, the first is plural because the 
nominal group is an additive complex while the third is a plural simple nominal group. 
From a grammatical point of view, it is thus hard to anticipate how plurality will be 
achieved in the nominal group. 

It is important to emphasize that the "agreement" between Subject and Finite is a 
clause phenomenon. Its existence and characteristics are interpretable in terms of the 
mood organization of the clause. Mood (Subject, Finite) - cf. Section 4.2. However, the 
fact that its existence is a clause phenomenon does not entail that the various person 
and number values necessarily have to be anticipated in the clause. 

Towards a solution of the agreement problem. Instead of trying to anticipate 
the different kinds of nominal group plurality in the clause, it seems preferable to adopt 
a solution like the following: 
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1. Specify in a realization statement In the clause that Subject and Finite are 
tht same in person anH inimber, but do not anticipate the various T^ossible 
values. 

2. Let the values be determined across constituents so that the nominal group 
choices of person and number values (Subject) determine verbal group 
choices (Finite). 

This possible solution seems like an obvious approach. The point here is that it 
requires new "realization machinery". For the first step, we need to be able to state the 
identity, say: 

Finite: [PERSON = PERSON (Subject)] 

which means that the PERSON system in the verbal group realizing Finite has the 
same value as the PERSON system in the nominal group realizing Subject. For the 
second step, we need to allow the choice of a feature in one system to be made 
according to the choice of a feature in another system. In other words, we have to 
allow the method of choosing — not the particular feature value — to be predetermined 
in this case. 

This procedure does not exist in Nigel at present. However, something similar is 
supported by Functional Unification Grammar; see Kasper's paper in this report. 

Clearly, there are other possible solutions to the "agreement problem". For instance, 
instead of having the grammar ensure agreement through some kind of realization 
statement, we can rely on semantic/conceptual consistency and make the person- 
number choices twice — once for the Subject nominal group and once for the Finite 
constituent. Agreement will then obtain simply because the two sets of choices will be 
guided by the same information. For example, the referent of the Subject of King 
Arthur and his knights are valiant, 'King Arthur and his knights', will determine both 
the person and number of the nominal group complex King Arthur and his knights 
and the form of the finite verb are. This is the kind of approach to "agreement" used 
in Nigel at present. 
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Whatever solution is adopted to solve the agreement problem, there is a high value to 
the kind of approach to inter-rank communication supported by FUG and discussed by 
Kasper. One reason for this is that agreement is one specific instance of a more general 
phenomenon, viz., a different kind of structuring (prosodic structuring) that is difficult 
to accommodate with the present set of (Nigel's) realization operators. 

3.6 Summary of constraints on structuring operators 

The following is a review of the most important restrictions on the realization 
operators Insert, Expand, and Order: 

1. Actual: The application of an operator leads to an actual result, not a 
potential one. There is no way of stating 'potentially 
inserted/ordered/expanded but not actually [in this case]'. 

2. Domain: The operators only operate within the iomain of the structure of 
one unit (such as clause structure, nominal group structure, etc.) and within 
the rank of that unit (clause rank, group rank, etc.). 

3. Compact functions: Furthermore, the operators operate on functions as 
discrete objects, without presupposing any internal organization of these 
functions; there is no overlap, fusion, or partial merger of functions. In 
particular, there is no realization operator "Include" for inclusion 
statements. 

4. Complex operands: Except for the operators Expand and Preselect, the 
realization operators do not allow complex operands. For example, only 
Process / Pr<:clicator is a possible realization statement. Process / (Predicator 
& Adjunct) s not. 

Up to now, I have only examined a few representational problems, viz., implicit 
functions, discontinuity, and agreement, and I will now turn to a more general 
characterization of representational problems. 

4 MODES OF MEA:NING AND MODES OF STRUCTURE: 
CHARACTERIZATION OF REPRESENTATIONAL PROBLEMS 

Pike (1959) and Halliday (1979) have pointed to the existence of different modes of 
linguistic organization. Pike suggests that we can look at a linguistic phenomenon as 
field, particle or wave. These diiferent perspectives are complementary and bring out 
different properties. ^ ^ 
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Halliday's pent is different from Pike's but ultimately related (see Halliday 1979). He 
identifies four modes of meaning, and suggests that four different modes of grammatical 
organization correspond to these four different semantic modes. The modes correlate 
with the different metafunctions: 
experiential — constituency 

interpersonal — prosody 

textual — pulse 

logical — interdependency (chain) 

These modes represent four different principles of structuring, i.e. prosodic 
structuring, and so on. In a function structure, two or more modes may exist in parallel 
through metafunctional layering. This happens in the clause, which is characterized by 
layers of transitivity constituency, mood prosody, and theme pulse. 

The tools used in representing grammatical structure are well suited only to the 
constituency mode of organization, and this gives us a basis for looking at 
representational problems as related in particular to the other modes. 

4.1 Constituency 

I will start with the mode of organizatior our grammar is good at, viz., constituency. 
Constituency is the grammar's mode for organizing experiential meaning: our 
experience is organized grammatically in constituency terms. Foi example, a process 
composite/configuration is decomposed into constituent parts such as the process itself, 
the participants involved in it, and the attendant cirumstances (transitivity). A 
rarticipant is usually an object, and is further decomposed grammatically as a thing 
and classifiers, epithets, and qualifiers (modification). 

Our common notion of grammatical structure fits this mode of organization quite well; 
we speak of grammatical constituents and draw tree diagrams. More specifically, the 
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realization operators discussed in Section 3 are constituency oriented. The structuring 
operators (Insert, Expand, and Order) create constituency, while the layering and rank 
operators rely on constituency. 

1. Insert specifies the presence of a function as a constituent of a grammatical 
unit. 

2. Expand explicitly creates a constituency relation between two functions. 

3. Order operates on functionally defined constituents (of the same unit). 

4. Conflate creates an identity relation between two constituent functions; the 
result is a constituent function bundle. 

5. Preselect does not create constituency itself, since it specifies a constraint on 
how to realize a function, but it presupposes the re-expression of a 
constituent function of one unit (e.g., the Agent of a clause) as the structure 
of another unit (e.g.. Epithet " Thing of a nominal group). In other words, 
preselection is in terms of the overall constituency organization. 

The functions themselves are (labels of) constituents. For example, (i) they are 
segmental in nature; no function can stretch over more than one segment unless it 
stands in a constituency relation to other functions (through the application of the 
Expand operator), (ii) They are clearly bounded segments of a categorical nature; there 
is no gradience (cline). 

The use of the operators is summarized diagrammatically in Figure 12 to bring out 
their creation of, or reliance on, constituency. 

Each constituent in an experiential layer of structure has a unique value in relation to 
the whole; the organization is multivariate. For example, a transitivity structure is 
organized into functions such as Senser Process Phenomenon Manner Locative. Since 
constituency is the experiential mode of structuring, experiential re Oization statements 
are usually well accommodated by multivariate constituency organization. For example, 
if a mental clause is metaphenomenal, there is a preselection stated in terms of one of 
the multivariate functions, the Phenomenon: "Phenomenon: noun-clause". If a clause 
is an unmarked intensive ascriptive one, the Process is constrained to be 'be' and the 
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structuring 




Conflate 
Agent 

■ 

rank 

Preselect 

m 
m 
• 
I 

nominal group 



Figure 12: Realization operators and constituency 
Attribute is a group with either a nominal or an adjectival head. Default orderings are 
also stated in terms of the multivariate constituent values. For example, we can state 
(Process Manner Locative), which means 'Process precedes Manner whic' precedes 
Locative'. 

In contrast, realization statements other than those deriving from the experiential 
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metafunction are not as easily accomodated by multivariate constituerivy. For example, 
logical realizations are stated more naturally in terms of (univariate) interdependency 
than in terms of constituency: see Section 4.4 below. 

The observation that the experiential mode of organization is (multivariate) 
constituency does not mean that there are no problematic cases. Linguists have 
sometimes felt a need to interpret structures as f isions/mergers and to leave constituent 
status indeterminate; such interpretations are outside the scope of Nigel's present set of 
realization operators. 

4.2 Prosody 

Agreement is an example of a mode of organization different from constituency; I 
would suggest it is what Halliday calls prosodic. It is the organization of grammatical 
structure, not as constituent segments, but as prosodies. If it is stated in terms of 
constituency, it typically has to be characterized as running across more than one 
constituent just as e.g., nasality may run across several phonological segments. 
Agreement is a case in point. It is, as already suggested, a mood characteristic rather 
than a Subject property or a Finite property per se, since it runs across these two 
constituents. 

Prosodic organization includes; 

mood number & person (agreement) 

moodtag 

polarity 

reflexivization 

key 



Moodtag. Mood number & person and moodtag are closely related. If a moodtag is 
present, as in You are coming out tonight^ aren't you?, the person and number 
selections run through both the mood and the moodtag. This prosody is represented 
diagrammatically in Figure 13. 
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You are convng out tonight, aren't you? 



Figure 13: Mood and moodtag - number & person prosody 

The existing realization operators cannot achieve more formally what is represented 
diagrammatically in Figure 13. Features can only be manipulated as segments 
associated with constituent functions; they cannot be introduced as prosodies. 

Polarity. Polarity is similar: it is interpersonal in that it represents the speaker's 
denial or affirmation, agreement or disagreement, command or prohibition, etc. with 
respect to a proposition/proposal (cf. also Givon (1979: chapter on Negation)). 
Structurally, it is often not locatable in any one particular place. In "non-standard" 
English it is often realized wherever possible; the particular transitivity value does not 
matter: 

/ ain^ never had no trouble xuith none of 'em 
(from Labov (1970) 

"Standard" English is of course quite similar in this respect. The difference is that 
the pattern is not not ... never no none as in the example above but rather not ... ever 



ERLC 



AO 

34 



any any with so-called non-assertive polarity items following not. The prosodic 
character remains; the "non- assertive" forms are typically used wherever possible. 
Diagrammatically, the polarity prosody can be represented in the same way as the mood 
person & number prosody; see Figure 14. 



am t never had no trouble with none of 'em 

haven't ever had any trouble with any of them 



Figure 14: Polarity prosody 

Just as before, there is no formal representation of the prosody, no realization 
operator that can be used. And just as before, this is to be expected sirxce the 
realization operators are constituency-oriented rather than prosody-oriented. 

Reflexivization. Reflexivization also seems to work as a prosody. Whenever 
appropriate, a constituent following the Subject of a clause is marked for identity of 
reference: 

Henry talked about Mmself to himself often in those days of intense 
loneliness 

The particular value in the transitivity structure of the constituent reflexivized (Goal, 
Actor, Recipient, etc.) does not matter; what matters is whether the reflexive prosody 
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runs through it or not, and whether it can expound the reflexive prosody or not. Again, 
the formal problems In a treatment of reflexivization can be predicted from its prosodic 
nature. There is no easy way to use the Preselect operator to achieve reflexivization 
wherever appropriate. 

Key. Key is perhaps more obviously realized prosodically than the other regions since 
it is realized by tone; cf. for example Halliday (1979). 

Representation of prosody by constituency: Expand. The representational 
problems with prosodic organization are brought out when we try to capture the 
prosodies in constituency terms. Halliday has introduced a constituency interpretation 
of the clause as an interpersonal unit (the clause as exchange/interaction). This is the 
Mood ' Residue structuring. Mood consists of Subject and Finite and Residue is, in 
principle, the rest (Predicator, Complements, and Adjuncts). These constituents can be 
used in stating the domains of the interpersonal prosodies, the domains of person & 
number, polarity, etc. This possibility is illustrated in the two diagrams of Figure 
Figure 15, first for the example from Figure 14 and then for an example also involving 
reflexivization. 

The features 'speaker', 'reflexive', and 'non-assertive' have been associated with the 
Mood and Residue fun.-ions, not with the fuactions that are directly realized by groups 
(Adjunct, Complement, etc.), to indicate their domains in constituency terms. (I use 
the feature 'non-assertive' in the way it is used by Quirk et al. I intend it to be taken 
only as a short-hand approximation.) 

This approach could be a constituency solution to the problem of representing 
grammatical prosodies. However, there is no formal correlate of the diagram cast in 
terms of realization operators. Presently, the Preselect operator can only be used to 
associate a feature with a grammatical function that is directly realized by the 
grammar. Both Mood and Residue are further expanded rather than directly realized. 
Furthermore, the diagram presupposes that a knowledgeable procedure of feature 
Ustribution will be activavw where appropriate. For example, the feature 'non- 
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(a 



) / 


haven't 


ever 


had 


any trouble with any of them 


Subj. 


Finite: 
negative 


Adjunct 


Pred. 


Complement 


Mood 


Residue: 

non-assertive 



haven't ever asked myself any hard questions 



Subj. 


Finite: 
negative 


Adjunct 


Pred. 


Complement 


Complement 


Mood: 
speaker 


Residue: 

reflexive, non-assertive 



Figure 15: Prosodic organization and constituency- 
assertive' should only be applied to adverbial and nominal groups that realize functions 
expanding Residue. The feature 'reflexive' would not behave like other features. It 
really means something like "check to see whether the nominal group is coreferential 
with the Subject; if it is, choose 'reflexive' in that group". And just as with the feature 
'non-assertive', there is the ib^.e of appropriate applicability. 
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4.3 Puise 

Halliday (1985) argues that the discontinuity in Thty ca'ded the meeting ©//arises for 
textual I ;asons. In information structure, the function Mew is assigned to the process of 
the clause in an unmarlced way, which means that it should be the last open-class 
lexical item of the clause. If there is a complement, this can be achieved by "splitting" 
a phrasal verb, thereby creating discontinuity. The assignment of New is one of two 
kinds of textual prominence that affect the clause. It is a culminative kind of 
prominence (cf. Halliday (1979; 1985)), while the other is tnematic prominence. Both 
may lead to representational problems: 

Culminative prominence: split Subject 

split Complement 
-sp/if Process 

Thematic prominence: thematic Minor complement (in p;ep. phrase) 

thematic participant from projected clause 
thematic Qualifier of symh; lie Thing 



Culminative ninence. The representational problem with the first ^ype, 
culminative prominence, is essentially associated with discontinuity: a mood or 
transitivity constituent is split into two parts, one of which appears later than the 
other, presumably as unmarked New. (Interestingly, the part thac is sequentially 
"delayed" in this way can usually be interpreted as a ;9 in a logical structure.) The 
problem was illustrated above with a split Process. Here are some examples of the other 
types: 

. ..nvone is a fool who tries to ^at an alligator 

I'll give anyone ten dollars who can help me. find my cat 

Questions remained about less fortunate enemies of the Marcos 
regime (Time) 
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The attempt was made to find a practical way to solve the problem 
without going into its theoretical depths {Tillich) 

I call a person bad who lies aiid cheats and is unkind (Maugham) 

{Some of these examples may involve grammatical metaphor. For example, the last 
example can perhaps be related to / call a person bad \\ if he lies and cheats and is 
unkind.) 

Thematic prominence. The second type, thematic prominence, may lead to 
problems quite similar to those of the first type: an element is a constituent of one unit 
under one interpretation {usually transitivity or modification structure) and a 
constituent of another unit under a textual interpretation, more specifically as Theme. 
For example, this house is the Minirange of the prepositional phrase realizing the 
circumstance Locative {a transitivity function in the clause) and it is the Theme of the 
clause in which it appears initially: 

This house we lived in for ten years 

The constituency problem is diagrammed in Figure 16 {the@sign indicates the non- 
thematic position of the Minirange). 

Notice that the constituency problem of the example is not an experiential one. 
Rather, the conflict arises as the two metafunctions, the textual and the experiential, 
are mapped onto one another through Theme selection. Textually, this house is a 
constituent of the clause, but experientially, it is a constituent of the prepositional 
phrase. {This kind of structure is very common in wh-clauses, of course, where the wh- 
item is the unmarked Theme: Which house did you live in?) 

Why does the textual metafunction not "respect" the constituency organization 
created by the experiential metafunction? There are at least two related answers. One 
answer is that we simply have two different constituent structurings, the textual 
{Theme-Rheme) and the experiential {transitivity and minor transitivity); and that 
although they are normally in phase with one another, there are situations where the 

ERslC 39 



clause: 
prep, phr.: 


Theme 


Rheme 






Locative 




@Minirc>nge 




Minirange @ 






This bouse 


we lived 


in 


for ten years 



Figure 16: Theme and marked constituency 
two structuring principles are out of phase with one another. This answer goes some 
way towards describing the situation in a princ'ipled way and it certainly seems better 
than merely observing that a constituent has been dislocated. 

The second answer goes beyond distinguishing two distinct constituency principles. It 
says that the textual principle of structuring is not really a constituency one, but has a 
different character. I will call this kind of structure a pulse, because textual prominence 
can be thought of as a pulse thac ripples through the experiential structure. Halliday 
calls this kmd of structure periodic. The notion can be compared to Pike's wave. 

There are at least three important reasons for the pulse or WLi-ve interpretation: 

1. The pulse (wave) is periodic, with phases of prominence and phases of non- 
prominence. (There are in fact two pulses: the thematic pulse and the 
information pulse.) 

2. The pulse (wave) is characterized by gradual transitions rather than sharp 
constituency boundaries. 
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3. The pulse (wave) is dynamic, running across the constitueacy hierarchy. 

The third point in particular explains why textual structuring does not respect the 
hierarchic organization created by the experiential metafunction. Note that the theme 
pulse characterizes only clauses, not groups and phrases. Or rather, the pulse is built 
into the nominal and verbal group structures realized sequentially with a deictic-finite 
starting point (see Halliday (1985)), but there is no choice. The textual metafunction 
does not generate a constituency hierarchy of themes in the way the experiential 
metafunction generates a constituency hierarchy of phenomena decomposed into 
processes, participants, and circumstances, and then e.g , participants in their turn 
decomposed into things, epithets, and so on. 

To represent textual structure graphically, Halliday uses a diminuendo-crescendo 
diagram. Consider the following example with multiple themes: 

In two or more years, both in London to which he went for the last part of 
the season and to pay a round of country house visits in the early autumn, 
and in Paris, where he had settled down, he knew everyone whom a young 
American could know. 

(Maugham) 

The multiple themes can be represented graphically as follows (skipping part of the 
example to accomoaate it in the figure); see Figure 17. 

Representation of pulse by constituency: Expand. The thematic pulse affects 
interpersonal and experiential functions and it can be represented as if it were a 
constituent function expanded by the functions it affects. We have to treat the non- 
discrete pulse as if it were a discre^.e constituent Theme, finding a thematic boundary 
somewhere (see Halliday (1985: chapter 3)). In addition, we have to give up on 
representing the dynamic nature of the pulse and its disregard for experiential 
constituency. The expand operator can be used. Theme (Place) and Theme (Time), 
giving the following structure; see Figure 18. 



47 

41 



In two years or more, both in London ... and in Paris ... he knew .. 



Time 


Place 


Senser 


Process 


Theme 

















Figure 17: Thematic diminuendo part of the textua' pulse -- graphic 

representation 



In two years or more, both in London ... and in Paris ... he knew ... 



Time 


Place 


Senser 


Process 


Theme 





Figure 18: Representation of thematic pulse by constituency 



ERIC 



42 48 



4.4 Interdependency (chain) 

The logical metaf unction generates interdependency or chain structures; for a general 
discussion, see Martin (1986). When a logical element of structure is inserted, it is 
inserted as interdependent on the previous logical element and not as a constituent of 
some whole. The structure is thus like a chain. Each new link is defined in relation to 
the previous link. In principle, this is the structuring that characterizes complexes of 
units rather than simplex units: Tom, Dicky and Harry as opposed to Tom; I went to 
Jot's Place and had a suak as opposed to / went to Jot's Place. In particular, clause 
complexes and prepositional phrase complexes have interdependency structures, but 
clauses and prepositional phrases do not. Groups can, however, be interpreted as 
having interdependency structures (see Halliday (1985: ch. 6)). The structure of the 
nominal group complex Tom, Dick, and Harry can be diagrammed as follows in Figure 
19. 



Tom, Dick, and Harry 



1 2 ► 3 



Figure 19: Graphic representation of interdependency structure 

The functions in the interdependency structure do not have unique constituency 
values. Instead, the functions are values in a logical ordering such as 1 2 3 4; the value 
of each function is determined by its logical ordering. Tom is 1 because it is the first 
coordinate, Dick is 2 because it is the second coordinate, and Harry is 3 because it is 
the third coordinate. 

The general representational problem with interdependency structures is that there are 
O 43 49 
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currently no realization operators that can be used to specify them. In Figure 1, I used 
X -> y to mean 'insert y as interdependent on x\ but there is no formal interpretation 
of it. The Insert operator discussed in Section 3.3.1 cannot be used to insert a logical 
function, since it inserts a function as a constituent of some unit and not as 
interdependent on another function. Examples of interdependency structures include 

Clause complexes: clause coordination, etc. 

Nominal group complexes, etc. 

Modification in nominal groups 

Tense in verbal groups. 



As already mentioned in Section 4.1, realization statements deriving from the logical 
mctafuncticn are easier to state in terms of interdependency than in terms of 
constituency. Experiential realization statements make reference to (multivariate) 
functions with unique values in relation to the whole. Logical realization statements 
make reference to (univariate) functions whose values reflect their logical ordering in 
interdependency. I will discuss clause tense and coordination only. 

Tense. As an example of tense structure, I will take the verbal group 7/ have been 
going to be being tested from Halliday's example below (Halliday (1976: 145)): 
Can I use that machine when I come in at this time tomorrow? 

No — it's going to be being tested. 

It'll have been going to be being tested every day for a fortnight soon. 

The logical structure given to the verbal group includes tense and voice auxiliaries as 
follows in Figure 20. (The structure is a dependency structure and the letters of the 
Greek alphabet are used to represent the logical functions; the structure is Figure 19 is 
a paratactic one. The difference is discussed in Halliday (1985: Chapter 7),) 

Realization statements have to specify the sequence of the functions of the verbal 
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7/ 



have 



been going 



to be 



being 



tested 



a 



^ r 



Figure 20: Logical interdependency structure of verbal group 



group and they have to specify the inflectional forms. Both specifications are stated 
most naturally in terms of the dependency ordering, (i) A dependent function follows 
the function it is dependent on: ^ follows alpha, and so on. (ii) The inflectional form of 
the verb realizing a dependent function is determined by the realization of the function 
it is dependent on: have (^) determines the past participial form of been going to (7). 
(For a thorough discussion of dependency and auxiliary order, see Schachter (1981).) 

Clause coordination. Coordination is similar to tense in respect to what realization 
statemcii.^ ?d to refer to. The sequence of coordinated clauses reflects the order in 
which they are coordinated. Furthermore, structural ellipsis is determined by 
interdependency relations. For example. Subject ellipsis works forwards in the 
interdependency structure (progressive ellipsis) while Complement ellipsis works 
Lackwards (regressive ellipsis). Both kinds of ellipsis make reference to direction in the 
interdependency structure, not to constituency. Furthermore, progressive ellipsis of 
Subject, for example, can continue for as long as the interdependency chain continues. 

It is also worth noting that structural markers such as and and or are specified by 
reference to the ordering of the chain, before the last link. In this respect, they contrast 
with both case marking and marking by preposition. 
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5 MODES OF MEANING AND MODES OF CHOICE 

The main point of the previous section is that different kinds of meaning (experiential, 
interpersonal, textual, and logical) are associated with different kinds of structure 
(constituence, prosody, pulse, and interdependence). This enables us to characterize the 
representational problems pertaining to realization operators in a principled way: the 
operators are constituency-oriented, and other kinds of structure will cause various 
types of problems for them. Although we can represent prosodies, pulses, and 
interdependencies diagrammatically, there are at present no realization operators that 
can introduce and operate on prosodies, pulses, and interdependencies. For example, 
insertion means insertion as a constituent in the structure of a unit; there is no insertion 
operator for interdependents (no 'insert x as interdependent on y'). 

The presentation so far has focused on problems in structural representation. 
However, I think that the different modes of meaning are reflected not only in the 
syntagmatic organization (defined by the realization operators) but also in the 
paradigmatic organization, i.e., in the network of systems. 

The experiential mode of meaning leads to multivariate constituency structures. The 
systemic organization is similar: systems have unique mv uivariate feature values and 
are ordered hierarchically in delicacy. The other metafunctions raise representational 
issues. This is t;rue particularly of logical and textual systems. I will look at them in 
that order. (I will not discuss interpersonal systems and the representation of 
interpersonal clines and prosodic features.) 

5.1 Logical: recursive systems 

Logical systems have long been recognized as having special status. The 
representational problem that arises from this special status was mentioned in Section 2 
and I will return to it now. Logical systems are assumed to be (linearly) recursive. The 
recursion is represented by a loop back to the entry condition in the system, as in 
Figure L 

Problem, systemic recursion. Henrici's (1966) diagnosis of the problem with 
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recursive systems has already been quoted (Section 2). They cannot be represented 
formally in the same way as "conventional systems". The output of the system also 
provides an input to it (the loop in FigUi*e l). Systemic features are distinguished by 
their names, and since the feature complex in the system in Figure 1 can be chosen 
more than once, there is a naming problem. It is not clear how the feature chosen the 
first time is distinct from the feature when io is chosen the second time, and so on. 
This is related to Henrici's remarks about partial ordering. Moreover, how is a 
realization statement to be applied recursively when a system is entered more than 
once? 

Towards a solution. One approach towards a solution is in terms of the type-token 
distinction. We can think of a system as a system type. Each time it is entered a 
system is instantiated as a token of the type. For conventional systems, there would 
only be one token per pass through the grammar, but recursive systems would allow 
multiple tokens. These tokens would have to be ordered by a counter so that each 
token has a unique place in the ordering: token 1, token 2, token 3, and so on. (Note 
that this approach appeals to the generation algorithm for traversing the system 
network. If we see the logical resources as essentially dynamic rather than static, the 
appeal to the generation algorithm follows naturally. For the distinction between 
dynamic and static systems, cf. Martin (1985).) 

In this approach, the difference between multivariate and univariate is the difference 
between instantiations of multiple types and multiple instantiations of the same type. 
For example, in the multivariate structure Actor Proce *^oal Recipient (I gave the 
report U enry), the participants are inserted by realization statements associated with 
three differr t systems. In the univariate structure 12 3 [and] 4 (Torrid Dicic, Harry, 
and Henry), the coordinates are inserted by the same system in successive instantiations 
of it. 



ERLC 



53 

47 



5.2 Textual: variable/dynamic systems 

Textual structure can be characterized as a pulse or a wave. Is there a systemic 
correlate, a system pulse comparable to the structure pulse? Arguably there is, if the 
pulse is interpreted as dynamic/variable in nature: a possible correlate is a 
dynamic/variable system. 

As an example, consider theme selection. The issue here is how to represent the 
resources of thematization. What are the thematization options? For example, what 
are the experiential theme options from tLe transitivity of a clause? The answer 
depends on transitivity choices (e.g., is the clause middle or effective, benefactive or 
non-benefactive, ranged or non-ranged?); on related circumstantial choices (e.g., is there 
a Locative, a Cause, a Manner, etc.?); and also on voice choices (e.g., is the Agent or tiie 
Medium the Subject and thus the unmarked Theme candidate?). 

The theme potential varies with the transitivity choices. It can be characterized, at 
least informally, as a (set of) dynamic/variable system(s). For example, in a middle 
(non-ranged) clause, there is only one participant Theme candidate, the Medium; in an 
effective (agentive) clause, there are two, the Medium and the Agent; and in a 
benefactive clause, there are three, the Medium, the Agent, and the Beneficiary. 
Similarly, there are as many circumstantial Theme candidates in a clause as there are 
circumstances. 

The notion of a dynamic or variable system is merely a way of thinking about a 
problem like theme selection; it is not a solution. It raises the issue of process-oriented 
accounts mentioned in Section 1. It does not seem unreasonable that some systems are 
inherently dynamic rather than static, particularly systems deriving from the textual 
metafunctic i.e., the enabling metafunction concerned with the process of meaning 
itself. 
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6 CONCLUSION: LOOKING FOR SOLUTIONS 

I have discussed various problems of representation and mentioned a few possible 
solutions. I have tried to diagnose the problems in terms of the four different 
metafunctional modes of organization, suggesting ^^peciH-^ problems associated with each 
of the non-constituency modes. 

There are, of course, problems in lexicogrammatical representation not discussed in 
this paper. They include: theme predication, ellipsis; grammatical metaphor, play on 
the system; and the accomodation of ;ertain lexical information (e.g., collocation, and 
lexical funr ions) lexical items not corresponding to grammatical constituents, lexical 
iter-s at ranks above word rank, and so on. These problems are not caused simply by 
non-oonstituency modes of organization, as are the problems I have focused on in the 
paper, but they can also be explained in principled ways. For example, one significant 
set of problems is caused by re-representation within the linguistic system itself. The 
set includes textual organization re-represento^l identifying relational transitivity 
organization (theme predication and theme nominalization), as well as grammatical 
metaphor in general, where e.g., one transitivity structure is re-represented by another 
transitivity structure (see Halliday (1985: Chapter 10)). 

Where can we find solutions? If we take natural language as a model for our 
metalanguage, our "language" of representation, two of the methods for expanding the 
p .ential that suggest themselves are metaphor and borrowing. 

(i) Metaphor. The prevailing metaphor is in a sense the source of the pr-^blems 
discussed in this paper. It is the constituency metaphor: treat all modes of structuring 
as if they were constituency. For example, as I showed in Section t.2 (Representation of 
prosody by consituency), a prosody running across constituents can be represented as if 
it were a function expanded by these constituents and a pulse can also be represented as 
if \t were a function expanded by the functions affected by the pulse (Section 4.3: 
Representation of pulse by constituency). 

(ii) Borrowing. We may borrow from other systemic "dialects": problems for Nigel 



may not be problems for Fawcett's and Hudson's grammars, for example. We may also 
borrow resources from other grammatical frameworks lo represent aspects of systemic 
theory. Some frameworks are typologically too far away from the systemic one; we are 
not likely to find much of any use to us in transformational grammar. Other 
frameworks are typologically much closer and in some cases there is a long tradition of 
exchange, e.g., stratificational linguistics. One kind of representation close enough to be 
of considerable interest is Kay's Functional Unification Grammar. I have not explored 
it in this paper, but have pointed to places where a FUG treatment is developed in 
Kasper's paper in this volume. Finally, we may borrow representational resources from 
other disciplines which have strong notions of e.g., processes and non-constituency 
organizations. 
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APPENDIX: REPRESENTATIONAL CONVENTIONS 



^i] System network 

(a] The MOOD type system has the entry condition (input features) clause and the 
terms (output features) indicative vs. imperative . If clause has been chosen, either 
indicative or imperative is chosen. 



r— indicative 



clause 



MOOD 
TYPE 



— imperative 



[b] The (DECLARATIVE] SUBJECT PRESUMPTION system has the entry condition 
(the input features) declarative and speaker and the terms (output features) implicit vs. 
explicit . If the features declarative and speaker have been chosen, either implicit or 
explicit is chosen. 
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(c) The MOOD TAG system has the entry condition (input features) declarativ e or 
imperative and the terms (output features) tagged vs. untagged . If the feature 
declarative or imperative has been chosen, either tagged or untagged is chosen. 



declarative 



imperative 




[— tagged 



MOOD 



TAG 



*— untagged 



[d] The CLAUSE COMPLEXITY system has the entry condition (input features) 
d^use or complex and the terms (output features) simplex vs. complex . If the feature 
complex is chosen, the entry condition of the system has been satisfied again. This is a 
so-called recursive system. 



clause 




r- simplex 



complex 
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[ii] Realization statements 

Name of operator 



Description 



(Insert Subject) 



(Withhold Subject) 



(Depend 1 2) 



(Order Subject Finite) 



+Subject 



''Subject 



1 -> 2 



Subject^Finite 



Function inserted as 
constituent of the structure 
of the unit being specified 

Function that is present by 
default is withheld from the 
function structure [Discussed, 
but not implemented 
in Nigel] 

Function 2 inserted as 
interdependent on function 1 
[Used as an illustration, 
but not implemented 
in Nigel] 

One function 
ordered to precede 
another function 



(Include Goal Process) 



(Expand Mood Finite) 



Process <Coal> 



Mood (Finite) 



One function is included 
within another function 
[Sometimes used in 
systemics, but not 
implemented in Nigel] 

One function is expanded 
to have anothe.^ function 
as constituent 



(Conflate Subject 
Agent) 



Subject/Agent 



One function is conflated 
with another function 
to form part of the same 
constituent function bundle 



(Preselect Subject 
singular ) 



Subject: 
singular 



A function is preselected for 
a feature; the realization of 
the function is constrained to 
display that feature 
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Systemic Grammar and 
Functional Unification Grammar 



Robert Kasper 
use / Information Sciences Institute 

1 Background 

As this paper is being presented in the context of a workshop on systemic linguistics, it 
will assume some ff^miliarity with systemic grammar. No familiarity with computer science 
or Functional Unification Grammar will be assumed. This section provides a historical 
perspective to what follows. 

Functionad Unification Grammar (FUG) was first formulated by Kay (1979) as a creative 
method for experimentation in computational linguistics. It is a framework that represents 
a granmiar in a form that is simultaneously readable by linguists and suitable for use 
by computer programs that generate or analyze text. Unlike many previous grammatical 
formaJisms, which required a different representation for analysis than for generation, FUG 
is intended to be neutraJ in this respect.^ In eao'lier work, FUG was also called Functional 
Grammar or Unification Grammar. 

The notation of FUG looks very different from that of Systemic Functional Gram- 
mar (SFG); but if we look beyond the differences of format, we find that FUG and SFG 
make many of the same assumptions about language and grsunmar. The similarities are 
due, in part, to the fact that when Maurtin Kay (1979) formulated FUG he was responding 
to many of Michael HaJIiday's ideas. 

Both FUG and SFG organize a grammatical description around feature choices, stress- 
ing the parauligmatic aspects of language structure. StructuraJ realizations are stated as 
consequences of these feature choices. Both represent the function of constituents explicitly 
in addition to the position of constituents in a sequence. Both provide a means for multi- 
functional description, making possible the expression of textual and interpersonal roles in 
suldition to ideational roles. 

Given these similarities, I was led to consider FUG as a promising tool for constructing 
an analysis program based on systemic graunmar.^ A more careful investigation has shown 
that it is possible to represent most of a systemic grammau' in FUG without any significant 
loss of clarity. Some naturaJ extensions to the FUG frar ^work handle the residue (see 
Section 4.3). Such rerepresentation of a grammar is motivated by Halliday's idea (1964) 
that the form of notation used to t xpress a graunmcur should suit the task for which it is to 
be used. FUG notation is especially well suited to the task of comnutational processing. 
If we aire able to represent SFG in FUG, then we have at our disposal the computational 
tools that have already been developed for FUG. 

^Any modifications to the generation grammar that are needed for the computational analysis program 
should be performed automatically by the program, as explained in (Kay, 1985). 

^BiU Mann originally luggested that FUG might be an appropriate tool for this purpose. 
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The grammar bemg used for the study is Nigel (Mann ic Matthiessen, 1985), a large 
grammar of English that is the result of a collective effort with significant contributions by 
several systemic linguists. This grammar has been used as part of a computer program for 
text generation, but there has previously been no program for text analysis. 

Section 2 provides an introduction to the basic concepts of FUG. Section 3 demonstrates 
how the systems of the Nigel grammar can be represented in FUG notation. Section 4 
explains some of the differences between FUG and SFG. Section 5 shows how the FUG 
framework provides some potential solutions to problems in systemic notation. Sec^i^n 6 
describes a computational analysis program, which uses the FUG representation of the NI;;el 
grammar, and indicates some of its uses. 

2 The Ingredients of FUG 

This section is a brief mtroduction to FUG. For a more thorough treatment the reader is 
referred to Kay's work (Kay 1979, 1985). 

2.1 Functional Descriptions 

In FUG each linguistic object is represented by a functional description (hereafter abbrevi- 
ated as FD). FDs contam lists of attributes, each composed of a label/value pair. Figure 1 
shows a simple FD, which contains three attributes, with the labels Rank, PrimaryClass, 
and Animacy. An equal sign (=) is used to separate the attribute label, which appears m 
the left, from the value, which appears on the right. The order of attributes m a description 
is not usually significant. 

FDs are also called feature structures in the computational linguistics literature. In this 
paper the features of an FD will be called attributes, in order to distinguish them from the 
features of systemic granmiar. 

FDs can be used to represent linguistic objects of any size, such as words, phrases, 
clauses, and even grammars. Each value in Figure 1 is a simple atomic symbol. An atom 
is a string of characters defining a value that is not further decomposed. Note that the 
case of the letters used in atoms can be significant. Thus, AGENT and agent would be 
regarded as different symbols. The values of attributes may also be FDs, allowmg one FD 
to be embedded mside others. Figure 2 shows how embedded descriptions may be used to 
describe groups that are constituents of a clause. The FD of Figure 1 is used as the value 
of the AGENT attribute in the clause description. Since an FD may contain values that 
are themselves instantiated by FDs, a functional description can be viewed as a hierarchy 
of label/value pairs. 



Rank = Group 
PrimaryClasa = Nominal 
Animacy = Animate 



Figure 1: A simple functional description 
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Rank = Clause 

Rank = Group 
AGENT = PrimaryClass = Nominal 
Animacy = Animate 
Rank = Group 
PROCESS = PrimaryClaas = Verbal 
Agency = Effective 



Figure 2: Embedded Functional Descriptions 

2.2 Unification 

Unification is an operation that combines two FDs, reconciling them into a single structure. 
The result of unification is em FD that contzdns all attributes of both of the original de- 
scriptions provided that they are compatible. The criteria of compatibility depend on the 
type of value. Simple atomic values are compt^ible only when they are identical. Values 
instantiated by complex FDs cltc compatible w. an all of their attributes are compatible. 
If a pairticular attribute is present in both FDs, then its values from both FDs are uni- 
fied. Those attributes which are present only in one of the FDs are retained in the result. 
Thus, unification is a recursive process, terminating when all embedded attributes have 
been unified. If any of the attributes fail to unify, then the entire unification is a failure. 

For example. Figure 5 is the FD that results from the unification of the FDs in Fig- 
ures 3 and 4. The attributes having labels Rank and PrimaryClass appear in both FDs and 
they aire compatible, because they have the same atomic value. The remaining attributes 
of each FD do not appear in the other, so they are simply copied into the result. 

2.3 Paths 

Paths are expressions that refer to the values of particular attributes in a FD. A path is 
composed of a sequence of labels enclosed in angle brackets. The value denoted by a path 
is found by a recursive procedure that extracts the value of its first attribute label, and 
applies the remainder of the path to that value. 

Let us illustrate this procedure by considering the value of the path <AGENT Rank> 
when it is applied to the FD of Figure 2. First, we locate the value of the attribute of the 
FD that has the label, AGENT, identical to the first label of the path. The value of this 
attribute is itself a FD composed of three attributes. Next, we use the second label of the 
path. Rank, to locate one of these attributes. Rank = Group, and extract its value. Thus, 
the path refers to the value Group. 

In general, each label of the path is used to locate attributes of successively embedded 
FDs. When the last label of the path has been used to locate an attri>^"te, the value of 
that attribute is taken to be the value denoted by the path. If any of ^.he labels of the path 
cannot be found in the FD, then the path has the special value NIL. 

A path may be used within the description of a grammar as the value of another attribute 
in order to indicate the unification of two attributes. For example, in the FD of Figure 6, 
the path <AGENT> is used as the value of the SUBJECT attribute. This signifies that 
the value of the SUBJECT attribute is to be unified with value of the AGENT attribute. 
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AGENT = 



Rank = Group 
PrimaryClass = Nominal 
, Animacy = Animate 



Figure 3: FD for AGENT. 



SUBJECT = 



Rank = Group 
PrimaryClass = Nominal 
Number = Singular 
Person = 3 
Lex = John 



Figure 4; FD 



AGENT = SUBJECT = 



for SUBJECT. 

Rank = Group 
PrimaryClase = Nominal 
Animacy = Animate 
Number = Singular 
Person = 3 
Lex = John 



Figure 5: Unification of AGENT and SUBJECT. 



Rank = Clause 

Rank = Group 
AGENT = PrimaryClass = Nominal 

. Animacy = Animate 
SUBJECT = <AGENT> 



Figure 6: FD containing a path as the value of an attribute. 



2.4 Patterns 



A special type of attribute, which has the label pattern^ describes how a function is realized 
by its constituents. Patterns specify the ordering of constituents with respect to each other. 
Pattern values contain a list of elements enclosed by parentheses. An element of a pattern 
may be the label of an attribute, which stands for a constituent function, or a special symbol 
(ellipsis) that indicates that any number of constituents may intervene in the pattern at 
that point. 

For example, the pattern shown in Figure 7 specifies that SUBJECT and PROCESS are 
constituents of the clause represented by this FD, that SUBJECT is ordered inmiediately 
before PROCESS, and that any number of other constituents may come before SUBJECT 
or after PROCESS. More complicated patterns are possible, but the details will be omitted 
here. 

2*5 Disjunctions 

FDs may also contain disjunctions, which represent sets of alternative descriptions. The 
alternatives of a disjunction are surrounded by curly braces (i.e., {, }) in the most conmion 
notation, as shown by the value A in Figure 8. When tne disjunctive value. A, is unified 
with another value, B (sdso shown in Figure 8), at least one of the alternatives of A must be 
compatible with B in order for the unification to be successful. If two or more alternatives 
are compatible with B, then the result of the unification is a disjunction containing the result 
of unifying B with each compatible alternative. Only one alternative of A is compatible 
with C. In this case, the result is a non-disjunctive value formed by unifying C with that 
alternative. 

2*6 Representing a Grammar 

We now have enough b&. ingredients to state how a grammar can be represented by an 
FD. For each unit described by the grammar (such as a clause or group), the grammar 
will contain a FD giving its attributes. In general, this FD will be quite large, containing 
alternatives for all possible values of each attribute. In systemic terms, each attribute 
corresponds to the description of a feature or a function. An attribute that describes a 
feature usually has an atomic value. An attribute that describes a function usually has a 
complex FD as a value, giving the attributes of the constituent which realizes the function. 
By representing all possible alternative values for each attribute, an FD can be written to 
express the grammatical potential of a language, in a way analogous to a system network. 



Rank = Clause 

pattern = (... SUBJECT PROCESS ...) 
Rank = Group 
PrimaryClass = Nominal 
Rank = Group 
PrimaryClass = Verbal 



SUBJECT = 



PROCESS = 



Figure 7: FD containing a pattern attribute. 
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A 
B 

A 

C 
A 



=11 



= B = 



= C = 



number = singular 
person = 1 

number = singular 
form = unmarked 

number = singular 
person = 1 
form = unmarked 
number = plural 
foim = unmarked 
\ number = plural 
fcim = unmarked 



number = singulsir 
person = 2 



[ number = plural j | 



number = singular 

person = 2 

form = unmarked 



Figurv? 8: Unification with Disjunctive Values 

3 Representing Systemic Grammar in FUG 

This section demonstrates by example how systemic grarrunar can be translated into the 
notation of FUG. Figure 9 shows the feature choices for two systems (MoodType and Indlca- 
tiveType) from the clause network of the Nigel grammar. Figure 10 shows the translation 
of these systems into FUG notation. 

3.1 Systems and Feature Choices 

Each systemic choice is encoded by a disjuncion that expresses the alternatives of the 
system. Each alternative of the disjunction conteiins an attribute with one of the systemic 
features that v.an be chosen in that system as its value. The system name is used as the 
label of that attribute. Notice that the symbol *:=' is used (in place of '=') to designate aU 
attributes that correspond to systemic features. The reason for this notation k explained 
in Section 4.3. 

The alternatives may also contain consequences of choosing a particular feature, includ- 
ing realizations and the representation of other systems that may be entered when that 
feature is chosen. Thus, delicacy can be expressed in FUG by embedding. The Indica- 
tiveType system is entered when the Indicative feature is chosen in the MoodType system, 
and it is represented by a FD embedded within the description of MoodType. 

3.2 Realization 

Figures 11 and 12 add the realization components of these two systems. This section 
explains the representation iu FUG notation of five common systemic realization operators: 
insertion, preselection, classify, order, and conflation. 

Insertion is indicated in FUG by a pattern containing the name of the inserted function. 
For example, when the Imperative feature is chosen, the NONFINITIVE function is inserted. 
This pattern states that the function must be present as a constituent of the clause, but it 
does not place any constraint on how NONFINITIVE may be ordered with respect to other 
constituents of the clause. 
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r-ClaU8€- 



MOOD 



RANK 



[-Imperative 



, INDICATIVE 
-Indicative — TYPE 



-Declarative 



-Interrogative 



Figure 9: The MoodType and IndicativeType Systems 

Rank = Clause 

[ MoodT>pe := Imperative ] 
MoodType := Indicative 

([ IndicativeType := D.c ative ] 1 
[ IndicativeType := Interrogative ] J 



Figure 10: The MoodType and IndicativeType Systems in FUG 



Clause - 



MOOD 
TYPE 



-Imperative 
+NONFINITIVE 
NONFINITIVElStem 



Int 



INDICATIVE 



TYPE" 



^SUBJECT 
-hFINITE 

SUBJECT:Nominative 



-Declarative 
SUBJECT A FINITE 



'-Interrogative 



Figure 11: MoodType and IndicativeType Realizations 

Rank = Clause 

r MoodType := Imperative 1 

pattern = (. . . NONFINITIVE . . .) 
L NONFINITIVr; = [ Form = Stem ] J 

MoodType := Indicative 

pattern = (. . . SUi^JECT . . .) 

pattern = (...FINITE...) [ 
uUBJECT = [ Case = Nominative ] 

(IndicativeType := Declarative ^ 
, piiUern (...SUBJECT FILITE ...) J > . 
[ IndicativoType := Interrogative ] ) i ) i 



Figure 12: MoodType and IndicativeType Realizations in FUG 
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Preselection is translated into FUG by specifying an FD for the given function. This 
FD constrams one attribute of the function so that it must have the value of the preselected 
feature. For example, when Indicative is cho8#*n, the SUBJECT of the clause is preselected 
to have the Nominative feature. In Figure 12, the attribute labeled by SUBJECT has a 
value that is an FD containing the statement Cast = Nominative. The label. Case, is 
taken from the name of the system that selects the fe^iture. Nominative, in the grammar 
for nominal groups. 

ClaBsify, the operator in Nigel that preselects lexical features, is translated by the 
same method as preselection, as shown by the classification of NONFINITIVE as a Stem. 
There is no formal distinction between lexical features and grammatical features in FUG, 
so preselection and classification are represented in exactly the same way. This is as things 
would be if Nigel used r, fully systemic lexicon. 

Order is tranalated into FUG by a pattern that contains the names of the functions in 
the required order. For example, when the Declarative feature is chosen, an ordeiing relation 
is specified requiring SUBJECT to immediately precede FINITE. The dots on either side of 
these functions are necessary, because this ordering relation does not specify any constraint 
on what may occur t: the left of SUBJECT or to the right of FINITE. 

Conflation is encoded precisely by unification. When two functions aie conflated, they 
become identified as a single function bundle sharing all properties of both functions. The 
same ' nd of identification is the result of unifying two FDs, as illustrated in Figure 5. This 
unification corresponds to the conflation of AGENT with SUBJECT. A single FD is created 
to represent the value of both functions. 

Table 1 summarizes how the realization operators of SFG can be translated into FUG. 
Different types of ordering can be expressed as different pattern configurations, as shown 
in Table 2. 

During a first experiment, approximately 80 systems from the clause rank of the Nigel 
grammar were translated into FUG. Subsequently, a program was implemented to auto- 
matically translate any system network into a functior de3cripti-)i. This program verifies 
the effectiveness of the method outlined here for translating \ system network into FUG 
notation. In j^ractice, it is convenient to translate each rank of a granunar into a separate 
FD. 

TVanslatiug the information of individual systems presented very few problems. The 
most significant problen^s have been encountered in making sure that the dependencies 
between different systems are adequately expressed. The methods used in solving these 
problems are described in st^ction 4.3. 

4 How FUG Differs from Systemic Grammar 
4.1 Purpose 

The primary difference between FUG and SFG is one of purpose. FUG is a method for 
representing granrmiars so that they may be used by computational tools, while SFG em- 
bodies a part^'^'ilar theoretical model of language. FUG allows a great deal of flexibility; 
it can easily encode mai^y different types of grammatical information as attributes, since 
attributes may contain as much information and embedded structure as needed. By I v ng 
many options open in this way, the FUG framework by itself does not attempt to s.rv^ 
as a complete theory of language. Rather, FUG is a vehicle for expressing grammatical 
mformation in a way suitable for use by a variety of computational processes. FUG and 
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SFG realiiation 



FUGdc8crip*.ron 



Presekct 
Classify 



Lcxify 
Conflate 



Insert 



-h SUBJECT 
SUBJECT : Nominative 
FINITE ! Singular 
FINITE = has 
SUBJECT / AGENT 



pattern = (. . . SUBJECT . . .) 
SUBJECT = [Case = Nominative] 
FINITE =^ [Number = Singular] 
FINITE = [Lex = has] 
SUBJECT = <AGENT- 



Table 1: Summary of FUG equivalents for SFG Realization Operators. 



REALIZATION OPERATOR FUG PATTERN 



Order 
Partition 
Order AtFVont 
Order AtEnd 



SUBJECT A FINITE (. . . SUBJECT FINITE . . .) 

FINITE I POLARITY (. . . FINITE . . . POLARITY . . .) 

I^THEME (THEME...) 

TOPICAL*! (. . . TOPICAL) 



Table 2: Realization Operators for order expressed in FUG. 



closely related models (e.g., FATR-H, described by Shieber, 1984) have been used to repre- 
sent grarmnaTS originating from several linguistic traditiom, including Systemic Grammar, 
Generalized Phrase Structure Grammar eind L'»>'ical Functional Grammar. 

4.2 Notational Differences 

Some superficial differences of notation exist between systemic grammar and its encoding 
in FUG. The most notable differences are: 

1. System names are used as attribute labels, whenever a systemic feature is uso^ 
as a value in an FD, as noted in Section 3. This use of system names is redundant 
in systemic granrniar , because the normal practice is to give a unique name to every 
feature in the graxmnar. In FUG the same value might appear with many diJerent 
attributes, so it is necessary to specify a label. There is no such thing as an unlabeled 
feature in FUG. 

2. Order operators are expressed as pattern configurations, as shown in Table 2. 
The patterns of FUG provide a general framework for describing order constraints. 
Several alternative methods have been developed as a means of using these patterns 
in actual computer programs. I use the pattern notation here because of its clarity; I 
will not elaborate on the specific details of how patterns £ire handled computationally. 

3. Encoding of Features and Functions. Features and functions are both encoded 
by the values of attributes in FUG. A feature is encoded by an atomic value, and a 
function is encoded by a value that is an FD, specifying the attributes of the con- 
stituent that realizes the function. The attributes corresponding to functions can be 
precisely identified by their occurrence in constituent patterns. Note that as a matter 
of convention, the labels for attributes corr .^sponding to systemic functions are written 
in upper case. 

These differences stem from the generality of FUG aj a representation for grammatical 
information. 
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4.3 Extensions to FUG Motivated by SFG 

The translation of systemic grammar into FUG lias suggested two extensions to the FUG 
framework: 

1. designating attributes that correspond to systemic features, 

2. distinguishing selection conditions that coirespond to complex input conditions of 
systems. 

Th^ first extension was already illustrated in the examples of Section 3. The second exten- 
sion will be introduced later in section 4.3.2. 

4.3.1 Designated attributes for systemic features 

The attributes corresponding to systemic features are designated by using the notation 
for unification in place of the usual notation. This special designation is necessary to 
insure that aU of the dependencies between different systemic features are enforced. For 
example, consider the IndicativeType system in the network of Figure 11. Both of the 
following implications are intended for this network. 

1. If a clause has the feature Indicative, then it must also have exactly one of the features 
from the IndicativeType system (either Declarative or Interrogative). 

2. If a clause has one of the feature^ from the IndicativeType system (either Declarative 
or Interrogative), then it must also have the feature Indicative. 

Now consider the FUG representation of this network, which is shown in Figure 12. 
If the attributes correspoikding to the systemic features were not designated with the 
notation, only the first of the two implications would holf^. It would be possible to unify 
a FD containing the attribute IndicativeType = Declarative with the alternative of the 
gramxnar that contains MoodType = Imperative, because there is no attribute with the 
label IndicativeType in that alternative. Remember that when two FDs are unified, any 
attributes present m only one of the FDs are simply copied to the resulting FD. 

A two-way dependency exists between the features of th3se two systems, but only 
one of the ways is encoded in the FUG representation. Only some of the attributes 
in a FD should have this kind of two-way dependency. For instance, the attribute 
pattern = (. . . SUBJECT FINITE . . .) does not imply MoodType = Indicative, although 
it occurs in the same alternative with the iLttnbute IndicativeType — Declarative, It is not 
possible to reliably determine which attributes of a FD should have the two-way dependency, 
unless those attributes are explicitly marked in some way. 

By giving a special designation to the attributes that correspond to systemic features it 
is possible to enforce both of the desired implications. The following interpretation is given 
to designated attributes. The presence of an attribute with the notation implies that 
any disjunctive FD that contains this attribute must have a valid alternative that contains 
this attribute. Therefore, the presence of a designated attribute (i.e., IndicativeType) also 
implies that all other such designated attributes in the surrounding portions of the descrip- 
tion (i.e., MoodType) must have the value (i.e.. Indicative) that is given in the alternative 
containing this attribute (IndicativeType). Usually at most one attribute of each alterna- 
te e of a disjunctive FD will be marked as a designated attribute, and this attribute wiii 
correspond to * systemic feature. 
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4.3.2 Difltinguishing selection conditions for complex input conditions of sys- 
tems 



Martin Kay's formulation of FUG [Kay79] pro/ides a natural way to express the dependen- 
cies between systems by embedded FDs. For e; ample, the system IndicativeType has input 
condition Indicative and Indicative is a feature chosen in the system MoodType. In FUG, 
the FD corresponding to the IndicativeType system is embedded within the alternative of 
MoodType where Indicative is chosen. The method of embedding, combined with desig- 
nated attributes, works well for simple input conditions, but it does not provide a solution 
for inpat conditions made up from complex boolean expressions. 

Consider, as an example, the DoingType system from Nigel, shown in Figure 13. Its 
input condition is the conjunction of two features. One way to express this input condition 
in FUG is to use a disjunction having two alternatives, as shown in Figure 14. One al- 
ternative corresponds to what happens when the DoingType system is entered. The other 
alternative corresponds to what happens when the DoingType system is not entered, '^he 
alternative containing the DoingType system also includes the attributes of the input con- 
dition. The other alternative includes the attributes of the negated input condition. Notice 
that the input condition and its negation must both be stated explicitly, unlike in sys- 
temic notation.^ This method of encoding input conditions also requires a computational 
treatment of negated attributes. Although such a treatment of negation is possible, it can 
increase the complexity of unification (see Karttunen, 1984). 

It should not be necessary to state the negated input condition explicitly, since it can 
always be derived automatically from the unnegated condition. It is also rather inefficient 
to mix the attributes of the input condition with the other attributes of the system. The 
attributes of the input condition contain exactly the information that is needed to choose 
between the two alternative's of the disjunction (i.e., to choose whether the system is entered 
or not). Therefore, it woi be more efficient and less verbose to have a notation in which 
the attributes of the input condition are distinguished from the other attributes of the 
system, and in which those attributes only need to be stated once. 

In order to handle complex input conditions we have developed an extension to FUG 
that uses the input condition as a selection condition on alternatives of a disjunctive FD. 
This selection condition is expressed as an FD, but it is a description that has a new kind 
of interpretation in the context of the entire grammar. When the condition is satisfiable 
(i.e., when aM of its attributes can be unified with the values specified for them) then 
an alternati^ e containing the FD corresponding to the system is chosen. Otherwise, an 
alternative that does not contain the FD of the system is chosen. This second alternative 
may be omitted in many cases. The use of a selection condition for the DoingType system 
is illustrated in Figure 1[>. The FD of the selection condition appears to the left of the — > 
symbol, anc^ the FD to be used when the selection condition is satisfiable appears to its 
right. 

4.3.3 Summai^' of extensions 

The two extensions dt^scribed here may have a rather general application. In both cases 
there is a way of eno^ding the same information in terms of standard unification, but it 

^If the negation of tlie input condition were not included in the alter*'! at iVe that does not include the 
description of the systeia, it would be possible to use this alternative even when the input condition for the 
system holds. Thus the description of the system would not be used when it should ' e. 
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TYPE 
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PROCESSrDispositive 



Figure 13: The DoingType System 

Rank = Clause 

Agency = Effective 
ProcessType = Material 

DoingType := Creative 
, PROCESS = [ Type = Creative ] 
DoingType := Dispositive 
PROCESS = [ Type = Dispositive ] 
Agency = NOT Effective ] 
ProcessType = NOT Material ] 
DoingType := none 
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Figure 14: DoingType system in FUG, using disjunction and negation. 
Rank = Clause 

DoingType := Creative 



Agency = Effective 
ProcessType = Material 



PROCESS = [ Type ^ Creative ] 
DoingType := Dispositive 
PROCESS = [ Type = Dispositive ] 



Figure 15: DoingType system in FUG, simplified by using a selection condition. 
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requires at least doubling the amount of information which encodes that information in the 
systemic description. Therefore, by adding these extensions, we havt not changed what FUG 
can do, but we have developed a representation that is more perspicuous, less verbose, and 
computationally more efficient. It appears that these mechanisms may be useful additions 
to the FUG framework, even when it is not used to directly model a systemic grammar. 
They suggest that a general treatment of logical implication between FDs might be useful 
in FUG. One of the most inefficient aspects of the standard unification procedure is the 
selection of alternatives in disjunctions. The selection condition provides a way of singling 
out attributes that may be particularly significant in making a selection, so that those 
' ttributes can be examined before attempting to unify the rest of the attributes. 

Two factors in this study have led to the need for these extensions. The first is, quite 
naturally, the particular ways in which features are organized in a systemic description. 
The second is the size of the grammar. Most previous work with FUG has concentrated on 
the representation of particilar phenomena occuring in natural language, ^ut few, if any, 
attempts have been made to use FUG for a large grammar (i.e., one whic^ approximates 
an entire language). These extensions may not be important for small grammars, but they 
seem to become crucial as the size of the grammar grows. 

5 Solutions to Problems in Systemic Notation 

M atthiessen (in the first part of this report) indicates several problems in developing a more 
formal algebraic account for phenomena that have been described by systemic linguists. As 
mentioned in Section 4, FUG offers some flexibility, and this flexibility might be used to 
augment systemic notation. Here, two of the problems presented by Matthiessen will be 
addressed: discontinuous constituents and feature propagation. 

5.1 Discontinuous Constituents 

The problem of discontinuity arises when the constituents that realize a particular function 
are not sequentially continuous. Matthiessen has indicated that it would be desirable to have 
a method of simultaneously representing functional continuity and sequential discontinuity. 
FUG provides such a method, since there is no stipulation that a FD must represent a 
sequentially continuous unit. 

A common type of discontinuity is a split process in a clause, as exemplified by They 
c died the meeting off* Figure 16 shows the desired analysis for this clause according to 
Matthiessen.^ Figure 17 shows how this analysis can be represented in FUG. 

In the FUG representation, a discontinuous constituent (PROCESS) of a CLAUSE 
contains two subconstituents (PREDICATOR and ADJUNCT), each of which is continuous. 
Each subconstituent is functionally a part of the PROCESS constituent, but sequentially 
ordered with respect to other constituents of the CLAUSE, as specified in the pattern, 

(. . . PREDICATOR COMPLEMENT ADJUNCT . . .)• 

The FUG description accounts for both transitivity structure and mood structure. The 
information about transitivity structure is contained in the description of the PROCESS 
attribute, and the information about mood structure is contained in the PREDICATOR and 

^The analysis is adapted from Halliday (1985, p. 186). 
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They called the meeting oflF 



IVansitivity 


AGENT 


PRO- 


MEDIUM 


-CESS 


Mood 


SUBJECT 


PREDICATOR 


COMPLEMENT 


ADJUNCT 



Figure 16: Desired Analysis of Split Process 



CLAUSE GRAMMAR contains: 



Rank = Clause 

pattern = (. . . PREDICATOR COMPLEMENT ADJUNCT . . ) 

' ProccssType = Material 
PROCESS = PREDICATOR = [ Class = Verb ] 

. ADJUNCT = [ Class = Adverb ] 
PREDICATOR = <PROCESS PREDICATOR> 
ADJUNCT = <PROCESS ADJUNCT> 
PrimaryTense = Past 
FINITE = [ Form = Past ] 
PrimaryTense = Present 



PrimaryTense = Future 



SecondaryTense = none 
[ IndicativeType = Declarative 



[ FINITE = < PREDICATOR > ] 



LEXICON contains: 



ProcessType = Material 
•caU off' = Predicator = [ lex = call ] 
. Adjunct = [ lex = off ] 
Class = Vfrb 
lex = caU 



«caU» = 
"off" = 



Class = Adverb 
lex = off 



Figure 17: FUG Description of Split Process 
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ADJUNCT attributesc In general, FUG is able to handle almost any sort of multifunctional 
description. 

The description also shows how the choice of particular features in the clause granmiar 
adds attributes selectively to the realization of particular subcoustituents. When the clause 
is declarative and the tense is simple (there b no secondary tense), then FINITE is conflated 
with PREDICATOR. Thus the inflectional form is added to PREDICATOR, but not added 
to ADJUNCT, aI:hough they both function as part of PROCESS. 

This solution to the problem of discontinuous constituents also indicates a method for 
describing compound lexical items, or lexical items realized by phrasal units. The lexical 
item for the compound unit call- off separate attributes for each of the simple lexical 
items, call and off, Mhen call-off is selected as the realization of PROCESS, its FD is 
unified with the FD of PROCESS in the clause grammar. As a part of this unification, the 
attributes for PREDICATOR and ADJUNCT are automatically supplied with appropriate 
lexical realizations.^ 

5.2 Feature Propagation 

Common systemic notation provides one way to specify features that must be selected 
for a constituent function: preselection. There is no way to specify that two constituents 
must have some features in common without preselecting those features independently for 
each constituent. In other words, there is only one type of feature propagation between 
constituents, the top-down propagation specified by preselection. Because unification is a 
method for specifying when particular attributes must share the same value, it can be used 
to produce the effect of more general feature propagation. 

An example of feature propagation between different constituents of the same rank is 
agreement It will be illustrated here by agreement between the subject and finite verb of 
a clause. At clause rank we would like to specify simply that the finite yeiu Las the same 
features of person and number as the subject. This relation should hold independently of 
the particular method that is used to select those features for the subject. A solution of 
this problem in terms of FUG is shown in Figure 18. 

In this solution all attributes that must agree ^**e grouped into a single FD, which is 
labeled Agr, For English, the Agr attribute would include the features for person and 
number with the alternatives: 



In other languages, agreement might include attributes for gender or case. With this group- 
ing, it is possible to specify agreement at clai'^e rank simultaneously for all of the appropriate 
attributes. 

The grammar of the nominal group specifies the criteria for choosing the values of the 
agreement attributes, and these values are unified with the Agr attribute for the group. For 
simplex nominal groups, the Agr attribute of the group has exactly the same values as the 
THING function of the group, as indicated by the unification, THING = [Agr = <Agr>j. 
The path <Agr> denotes the value of the A^r attribute for the entire FD of the nominal 

^For an analysif pre vam it is necessary to provide a more detailed description, so that the composite 
lexical item call- off c^n be reco mixed. One way to do this is to have a sense of the word call that requires it 
to be a part of a compound lexical item that contains the adjunct off. Other senses of call would not require 
an adjunct. The design of phrasal lexicon involves many issues that go beyond the scope of this paper. 



Agr = 



Person = {12 3} 

Number = {singular plural} 
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CLAUSE GRAMMAR contains: 



Rank Clause 

<FINITE Agr> = <SUBJECT Agr> 
. SUBJECT = [ Rank = NounGroup ] ^ 

NOMINAL GROUP GRAMMAR contains: 



Rank = NounGroup 

Complexity = Simplex 
[ THING = [ Agr = <Agr> ] . 
Complexity = Complex 
< ( ComplexType = Addition 
I Agr = [ Number = Plural ] 
I r ComplexType = Alternation 




Figure 18: FUG Description of Agreement. 

group. For complex nominal groups of the additive type, the Ayr attribute for the nominal 
group wiU have the value [Number = Plural]. 

When a nominal group realizes the SUBJECT function of a clause, these same values 
will also be added to the Agr attribute for FINITE, because the paths <FINITE Agr> and 
<SUBJECT Agr> are unified in the clause grammar. Thus the FINITE function must be 
realized by a constituent having an Agr attribute that is compatible with the Agr attribute 
of the SUBJECT. 

The solution provided should be applicable to many different types of agreement phe- 
nomena. For example, agreement in tag questions could also be handled using the same 
method. Similar techniques could be used to describe other prosodic phenomena. 

These two examples show that translation to FUG can provide a more formal elaboration 
of ideas that have already been developed informally in systemic theory. Several other 
notational problems also appear to have a reasonable solution in terms of FUG. For instance, 
other kinds of interaction across cycles of the granmiar are also possible. FUG allows the 
ranks of systemic grammar to be encoded, using an FD for each rank. However, there 
are no restrictions on the interdependencies that can be created between ranks, since the 
frar work allows any two attributes of any two FD's to be unified (provided they are 
compatible) . 
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6 A Grammatical Analysis Program 



The grammatical analysis program, which has been under experimental development at 
USC/ISI, performs the following tasks: 

1. segments a sentence into its constituents, 

2. assigns functions to each constituent, 

3. determines the set of systemic features chosen at each rank ^produces a selection 
expression). 

The first two steps represent syntagmatic analysis, and the last step represents paradigmatic 
analysis. These steps are not necess£Lrily performed in the order given above, rather they 
are usually performed in an interleaved fashion. Usually particular features need to be 
recognized before assigning a function to a constituent, and the properties of a function 
may lead to the determination of addition2d features. 

6*1 How the program works 

The type of analysis program most often used with FUG is called a chart parser, A good 
description of how a chart parser works can be found in Winograd's text (Winograd, 1983, 
pp. 116 - 127). A brief summary is all that can be included here. 

A chart is a kind of bookkeeping mechanism, which stores all the information that is 
known about constituents that have been recognized. It contains vertices representing points 
in the sentence, and edges representing constituents (and partially recognized constituents). 
The vertices are placed between the words of the sentence, and are labeled with indices, 
beginning with the index 0 to the left of the first word and ending with the index n after 
the n^^ word. Each edge connects a pair of vertices and represents a constituent that is 
composed of the words between those vertices. When a chart is used for FUG, the edges 
also contain FDs for the constituents that they represent 

The program begins to construct the chart by inserting an edge corresponding to each 
word of the sentence. A lexicon contains an FD for each sense of a word. These FDs contain 
attributes for the lexical features of the word. An edge containing a FD for the n^^ word 
would have a starting vertex n — 1 and an ending vertex n. The FD for a word is thus made 
available to the parser, as it attempts to build larger constituents. In order to build the FD 
for a constituent, edges from the chart are matched with the descriptions of functions in 
the grammar. This matching is accomplished by unifying the FD of an edge with the FD 
for a function. The functions are selected from the grannmar, so that they match edges in 
an order that is consistent with the pattern attributes of the grammar. 

The program proceeds (in a bottom-up manner), recognizing groups first, and creating 
a FD for each group. The FD for each group is then unified with the FD's for functions 
in the clause grammar. This is the sequence that would be followed for simple clauses. 
A different sequence would be followed to recognize constructions that result from rank 
shifting. For example, when a clause occurs as a constituent of a group, then the program 
would first build an FD for the clause and then unify this FD with the FD for the function 
of the group that has the attribute Rank = Clause in its description. 

^The program extends the PATR-II system, which has been developed by Shieber et al. at SRI Interna- 
tional (Shieber, 1984). 
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Provided with a method for interpreting the constituent patterns/ the chart parsing 
algorithm and the unification algorithm specify all the work necessary for grammatical 
analysis of FUG. The chart parsing algorithm determines a strategy for selecting a particular 
FD from the grammar and the FD for some constituent that has already been recognized. 
Then, the unification algorithm determines whether these two FDs match, and builds a 
more complete FD if they can be unified. When the process builds an FD of clause rank 
covering the entire sentence, then a complete analysis for the sentence has been formed. 

6.2 Potential Uses of the Program 

The resources offered by a computational analysis program based on systemic functional 
granmiar include: 

1. a grammar testing and research tool, 

2. text analysis in bulk quantities, 

3. a functional approach to natural language processing. 
6.2.1 Grammar testing and research tool 

Using a grammar in a computational environment requires a degree of precision that may 
not be necessary in descriptions written for human consumption. It forces the linguist to 
develop an explicit interpretation of its statements where intuition is replaced by rigour. The 
grammar becomes more accurate as inconsistencies are detected by a computer program 
that does not know what the linguist has in mind. A computational program can also 
reveal examples that are given an analysis other than what was intended by the linguist. 
The computer cannot correct most mistakes, but it can locate areas of the granmiar that 
need more attention. 

I am not proposing that a computer program can replace linguistic intuition, rather that 
it can help verify and refine linguistic descriptions. It can provide an answer to the question, 
"Does this system network really work?" When a network gets large it becomes practically 
impossible for any person to enumerate all possible combinations of feature choices. It may 
even take a long time for a computer to do this, but computers do not get tired as people 
do. 

This type of analysis program allows a linguist to study aspects of a grammar that have 
been previously inaccesible. It opens up the possibility of validating the grammar with a 
much larger body of data than can be done by an individual linguist. Such grammar testing 

^In order to recognixe constituents in an efficient manner, the patterns in an FD must be ^mpiled into an 
appropriate form. There is a considerable variety of techniques from computer science that can be applied 
to this task. Kay (1985) gives one method for compiling the patterns into augmented transition networks. 
Another method involves using a set of phrase structure rules to represent the patterns. This is the method 
used in the PATR-II parser. The phrase structure rules are used as a processing technique, but they are noi 
significant in the final descriptions produced by the parser. 

Using the PATR-U parser, it is not necessary to convert every pattern into explicit phrase structure rules. 
Some of the patterns may be converted into statements about attributes that mark the start and end of each 
constituent, corresponding to its endpoints in the chart. For example, in order to encode the pattern (1) 
the unification (2) can be used. 

(... SUBJ FINITE ...) (1) 
< SUBJ end > = < FINITE start > (2) 



O 18 ^ . 

ERIC SU 



can provide information about the following areas which are described below: coverage, 
delicacy, consistency, ambiguity, and rccoverability. 

Coverage: Can the granmiar recognize "x"? 

The program detects sentences which should be recognized, but which the current 
expression of the grammar does not include. 

Delicacy: Is the set of features adequate for describing "x"? 

A superficial grammar will have no features to distinguish sentences that should be 
given different interpretations. For example, is the position of an adverbial group 
in a clause functionally significant, or arbitrary? By enumerating the features that 
are actually assigned to a particular clause by the grammar, an analysis program 
may show that a feature has been assigned incorrectly, or it may become clear that a 
feature is missing. Automatic analysis allows an examination of the consequences of 
treating such variations as either accidental or purposeful. 

Consistency: Can all lines of analysis be reconciled? 

One of the strengths of systemic grammar is its multifunctional description of lan- 
guage. While desirable, multifunctionality makes it hard to determine whether the 
grammar is consistent. It is important to know whether the analyses given by the 
different metafunctions are compatible with each other. If there are incompatibilities, 
the analysis program will not be able to form a unified description, and it will show 
some attribute(s) that fail to unify. 

Another type of inconsistency can result from the large number of systems in a gram- 
mar. The systems are, in general, highly interdependent, so it is not always easy to 
grasp what effects a change in one system will have on the grammar as a whole. The 
computational analysis of specific sentences brings out cases where the features of one 
system do not match the expectations of another. 

Ambiguity: How many alternative analyses does the grammar produce for "x"? 

It is common for an untested grammar to produce many alternative analyses for some 
sentences. Only some of these alternatives may be intended by the grammar writer. 
For example, a fragment of the Nigel grammar produced two analyses for the subject 
of the clause "Norman is running." 

1. SUBJECT = MEDIUM = "Norman" 

2. SUBJECT = RANGE = "Norman" 

Clearly, only the first interpretation is correct, but the fragment of the grammar 
that was being tested did not cont^m enough information to eliminate the second 
interpretation. The program helps detect such unintended analyses, 

Recoverability: Which grammatical functions can be determined for "x"? 

Some grammatical functions can be determined only from the context in which a 
sentence is used. Using the analysis program, it is possible to see which functions 
cannot be recovered by the grammar. By isolating the difficult cases, it is easier to 
decide whether the grammar needs to be refined to make better use of the information 
that it has, or whether the dissmibiguating information must be found in the extra- 
sentential context. 

81 
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6.2.2 Text analysis in bulk quantities 



The idea of developing an automated corpus for language studies has interest for many 
scholars, including some systemic linguists. A prograri that uses an explicit representation 
of a grammar offers a queditative difference from previous work (e.g., syntactic tagging) by 
developing a more complete account of grammatical features, structures and functions. 

6.2.3 Functional approach to natural language processing 

In any computer application that requires simulating some degree of understanding human 
lantjage (e.g., as a method of extracting information from a large database), it is necessary 
to determine the functions of each constituent in a sequence. Many programs accomplish 
this task using a limited grammar for specific patterns that occur most often, but it is 
difficult to extend these grammars to new domains of discourse. A program that has the 
capability of using any systemic grammar as a basis for language analysis would be able 
to provide a more general and comprehensive account of granimatical functions than most 
programs that are currently used for natural language processing. 

7 Conclusions 

We have shown some ways in which rerepresentation and computational analysis cnn be used 
as a testbed for refinement of a systemic grammar. The rerepresentation in the notation 
of FUG shows that systemic granimars can be made formally explicit, and it is a way 
of extending the computational techniques that may b*^ applied with the grammar from 
sentence generation to sentence analysis. 

A trial version of the computational analysis program described in Section S has been 
implemented. This program is still under development. Several refinements are necessary to 
improve its efficiency, «nd make it available for the practical uses that have been mentioned 
here. 

The comparison of SFG with FUG has indicated some ways in which both frameworks 
may be enriched. The task of representing a large systemic granrmiar in FUG has shown 
where some extensions to FUG are necessary, if it is to be used for grammars approxi- 
mating those of real human languages. In turn, the solutions to notational problems in 
terms of FUG have shown some ways in which systemic notation might be enriched. This 
study has been beneficial for both frameworks; the tools of FUG can be used to improve 
systemic descriptions by testing and refinement, and applying FUG to systemic granunar 
has sharpened it as a tool. 

The FUG notation also enables a cn-^parison of systemic grammar with other models 
that use feature-based descriptions, and could make the insights of systemic descriptions 
available to a wider conmiunity of scholars. 
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